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HEMOGLOBIN RECEPTORS FROM NEISSERIAS 

This invention was made with government support under National Institute 
of Health grants R01 AI32493 and R01 AI22933. The U.S. government has certain 
rights to this invention. 



1. Field of the Invention 

This invention relates to hemoglobin receptor genes and the proteins encoded 
therefrom of certain bacterial species, particularly species of Neisseria bacteria. 
More particularly, this invention relates to hemoglobin receptor genes, polypeptides 
and peptides useful for preparing vaccines and antibodies against Neisseria, and 
methods and means for producing such peptides and polypeptides in vitro. Also 
provided an. diagnostic and therapeutic methods and reagents useful in detecting and 
treating Neisseria infection and methods for developing novel and effective anti- 
Neisseria agents. 

2. Background of the Invention 

The Neisseriae comprise a genus of bacteria that includes two gram-negative 
species of pyogenic cocci pathogenic for humans: Neisseria meningitidis and 
Neisseria gonorrhoeae. N. meningitidis is a major cause of bacterial meningitis in 
humans, especially children. The disease characteristically proceeds from 
asymptomatic carriage of the bacterium in the nasopharynx to invasion of the 
bloodstream and cerebrospinal fluid in susceptible individuals. 

Neisseria meningitidis is one of the leading causes of bacterial meningitis in 
children and healthy adults in the world. The severity of the disease is evidenced 
by the ability of meningococci to cause the death of previously healthy individuals 
in less than 24 hours. N. meningitidis has a polysaccharide capsule whose diversity 
of component antigenic polysaccharide molecules has resulted in the classification of 
ten different serogroups. Of these, group A strains are the classic epidemic strains; 
group B and C are generally endemic strains, but C occasionally causes an epidemic 
outbreak. All known group A strains have the same protein antigens on their outer 
membranes, while group B strains have a dozen serotypes or groupings based on the 
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presence of principal outer membrane protein antigens (as opposed to 
polysaccharides). 

Survival of a pathogen such as N. meningitidis in a host depends on its ability 
to overcome a battery of host defense mechanisms. One nonspecific host defense 
5 mechanism against microbial intruders is to limit the availability of iron in tissues 

(Weinberg, 1984, Physiological. Rev. 64: 65-102), because iron is a necessary 
nutrient for most microbial pathogens. The vast majority of iron in the human adult 
is located intracellularly in the form of hemoglobin (76%) or ferritin (23%). The 
remainder can be found extracellularly bound to host iron-binding proteins such as 
10 transferrin and lactoferrin (Otto et a/., 1992, Crit. Rev. Microbiol 18: 217-233). 

Pathogenic bacteria have adapted to this iron-limiting environment by 
developing highly specific and effective iron assimilation systems. A large number 
of these bacteria secrete siderophores, small, non-protein iron chelators which, due 
to their extremely high affinity for iron (III), scavenge trace amounts of iron(III) 
15 from the environment and shuttle the iron back to the bacterial cell (Baggs and 

Neilands, 1987, Microbiol. Rev. 51: 509-518; Braun and Hantke, 1991, in 
Winkelmann (ed.), Handbook of Microbial Iron Chelates, CRC Press: Boca Raton, 
Fla., pp. 107-138.). 

Alternatively, some bacterial pathogens, like Neisseriae species (Archibald 
20 and DeVoe, 1979, FEMS Microbiol. Lett. 6: 159-162; Mickelson et al. , 1982, Infect. 

Immun. 35: 915-920; Dyer et al., 1987, Infect. Immun. 55: 2171-2175), 
Haemophilus influenzae (Coulton and Pang, 1983, Curr. Microbiol. 9: 93-98; 
Schryvers, 1988, MoL Microbiol. 2: 467-472; Jarosik etaL, 1994, Infect. Immun. 
62: 2470-2477), Vibrio cholerae (Stoebner and Payne, 1988, Infect. Immun. 56: 
25 2891-2895; Henderson and Payne, 1994, /. Bacteriol. 176: 3269-3277), Yersiniae 

(Stojiljkovic and Hantke, 1992, EMBO J. U: 4359-4367) and Actinobacillus 
pleuropneumoniae (Gerlach et al., 1992, Infect. Immun. 60: 3253-3261) have 
evolved more sophisticated mechanisms to sequester iron from the host. These 
pathogens can directly bind host's iron-binding proteins such as lactoferrin, 
30 transferrin, and heme-containing compounds, and use them as sole sources of iron. 

The importance of iron in the virulence of N. meningitidis was demonstrated 
by in vivo studies using mice as the animal model system (Calver et al. , 1976, Can. 
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J. Microbiol. 22: 832-838; Holbien et al., 1981, Infect. Immun. 34: 120-125). 
Specific iron-regulated outer membrane receptors have been shown to be involved 
in the binding and the utilization of lactoferrin- and transferrin-iron in Neisseriae 
(Schryvers and Morris, 1988, Infect. Immun. 56: 1 144-1 149 and Mol. Microbiol. 2: 
281-288; Legrain et al., 1993, Gene 130: 81-90; Pettersson et al., 1993, Infect. 
Immun. 61: 4724-4733 and 1994, J. Bacteriol. 176: 1764-1766). These receptors 
share significant amino acid similarity and, most probably, also the mechanism of 
iron internalization, with receptors for siderophores and vitamin B12 of other Gram- 
negative bacteria (Cornelissen et al., 1993, J. Bacteriol. VIA: 5788-5797). In 
contrast, the mechanism by which Neisseriae utilize hemoglobin- and hemin-iron as 
well as the components involved have so far not been described. 

Recently, several proteins with hemoglobin-binding and/or hemin-binding 
activities have been identified in total membranes of iron-limited N. meningitidis and 
N. gonorrhoeae. 

Lee and Hill, 1992, J. gen. Microbiol. 138_: 2647^2656 disclose the specific 
hemoglobin binding by isolated outer membranes of N. meningitidis. 

Martek and Lee, 1994, Infect. Immun. 62: 700-703 disclosed that acquisition 
of heme iron by N. meningitidis does not involve meningococcal transferrin-binding 
proteins. 

Lee, 1994, Microbiol. 140: 1473-1480 describes the biochemical isolation and 
characterization of hemin binding proteins from N. meningitidis. 

The precise role of these proteins in hemin and/or hemoglobin utilization 
remains unclear at present, although these proteins are likely to be components of 
a hemin-utilization system in N. meningitidis. 

The dependence on host iron stores for Neisseria growth is a potentially 
useful route towards the development of novel and effective therapeutic intervention 
strategies. Historically, infections of both N. meningitidis and N. gonorrhoeae were 
treated chemoprophylactically with sulfonamide drugs. However, with the 
development of sulfonamide-resistant strains came the necessity of using alternative 
modes of therapy such as antibiotic treatment. More recently, the drug treatment of 
choice includes the administration of high grade penicillin. However, the success 
of antimicrobial treatment is decreased if therapy is not initiated early after infection. 
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Gonococcal infection has also been treated with penicillin, ampicillin, or 
amoxicillin, tetracycline hydrochloride, and spectinomycin. Unfortunately, because 
the incidence of infections due to penicillinase-producing bacteria has increased, 
several new, more expensive B-lactam antibiotics have been used in treatment. 
Despite the fact that existing antibiotics have decreased the serious consequences of 
gonorrhea, their use has not lowered the incidence of the infection in the general 
population. 

Prevention of meningococcal disease has been attempted by chemoprophylaxis 
and immunoprophylaxis. At present, rifampin and minocycline are used, but only 
for humans in close contact with an infected person as this treatment has a number 
of disadvantages. The only commercially available vaccine against meningococcal 
meningitis has as its major component the bacterial polysaccharide capsule. In adults 
this vaccine protects against serogroups A, C, Y and W135. It is not effective 
against serdgroup B, and is ineffective in children against serogroup C. Thus far, 
immunoprophylatic preventive treatment has not been available for N. gonorrhoeae. 
Thus, what is needed are better preventative therapies for meningococcal 

meningitis and gonorrhea including more effective, longer lasting vaccines which 

protect across all of the serogroups of N. meningitidis and all the serotypes of N. 

gonorrhoeae. In addition, better methods are need to treat meningococcal and 

gonococcal infection. 

SUMMARY OF THE INVENTION 

The present invention relates to the cloning, expression and functional 
characterization of genes encoding bacterial hemoglobin receptor proteins. 
Specifically, the invention relates to genes encoding hemoglobin receptor proteins 
from Neisseria species, in particular Neisseria meningitidis and N. gonorrhoeae. The 
invention comprises species of nucleic acids having a nucleotide sequence encoding 
novel bacterial hemoglobin receptor proteins. Also provided by this invention is the 
deduced amino acid sequence of the cognate hemoglobin receptor proteins of these 
bacterial genes. 

The invention provides nucleic acids, nucleic acid hybridization probes, 
recombinant expression constructs capable of expressing the hemoglobin receptor 
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protein of the invention in cultures of transformed cells, preferably bacterial cells, 
and such cultures of transformed bacterial cells that express the hemoglobin receptor 
proteins of the invention. The invention also provides gene knockout vectors for 
inactivating the hemoglobin receptor protein gene in cells, particularly cells of 
5 Neisseria species, via, for example, homologous recombination and other 

mechanisms, and cultures of such hemoglobin receptor protein null mutant cells. 

The invention also provides homogeneous preparations of the bacterial 
hemoglobin receptor proteins of the invention, as well as antibodies against and 
epitopes of the hemoglobin receptor protein. Methods for characterizing this 
10 receptor protein and methods for using the protein in the development of agents 

having pharmacological uses related to this receptor, particularly bactericidal and 
bacteriostatic uses, are also provided by the invention. 

In other embodiments of this invention are provided diagnostic methods and 
reagents encompassing the use of the anti-Neisseria hemoglobin receptor protein 
15 antibodies of the invention. Still further embodiments provided herein include 

therapeutic methods and reagents encompassing the use of the anti-Neisseria 
hemoglobin receptor protein antibodies of the invention. Even more embodiments 
include diagnostic methods and reagents encompassing the use of the Neisseria 
hemoglobin receptor protein-encoding nucleic acids of the invention, as sensitive 
20 probes for the presence of Neisseria infection using nucleic acid hybridization 

techniques and/or in vitro amplification methodologies. Yet additional embodiments 
of the invention include therapeutic methods and reagents encompassing the use of 
the Neisseria hemoglobin receptor protein-encoding nucleic acids of the invention, 
comprising recombinant expression constructs engineered to produce antisense 
25 transcripts of the Neisseria hemoglobin receptor gene and fragments thereof, as well 

as recombinant knockout vectors of the invention. The invention also provides the 
Neisseria hemoglobin receptor protein and epitopes thereof as components of 
vaccines for the development of non-disease associated immunity to pathological 
infection with bacteria of Neisseria species. 
30 In a first aspect, the invention provides a nucleic acid having a nucleotide 

sequence encoding a bacterial hemoglobin receptor protein gene. In a preferred 
embodiment, the bacterial hemoglobin receptor protein gene is isolated from bacteria 
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of Neisseria species. In a particularly preferred embodiment, the hemoglobin 
receptor protein gene is isolated from Neisseria meningitidis, serotype C. In a 
particular example of this embodiment, the nucleic acid comprises a 3.3 kilobase (kb) 
BamHI/Hindlll fragment of N. meningitidis genomic DNA. In this embodiment, the 
nucleotide sequence comprises an open reading frame of 2376 nucleotides of N. 
meningitidis genomic DNA encoding 792 amino acids comprising the hemoglobin 
receptor gene. In this embodiment of the invention, the nucleotide sequence of the 
N meningitidis hemoglobin receptor gene is the sequence depicted in Figure 2 (SEQ 
ID No: 1). It will be understood that the N. meningitidis gene as disclosed herein is 
defined, insofar as is necessary, by the amino acid sequence of the protein encoded 
therein, said amino acid sequence being represented in Figure 2 (SEQ. ID No.:2). 
Thus, it will be understood that the particular nucleotide sequence depicted in Figure 
2 (SEQ. ID. No.:l) is but one of a number of equivalent nucleotide sequences that 
encode the hemoglobin receptor protein, due to the degeneracy of the genetic code, 
and that all such alternative, equivalent nucleotide sequences are hereby explicitly 
encompassed within the disclosed nucleotide sequences of the invention. Also 
included herein are any mutant or allelic variations of this nucleotide sequence, either 
naturally occurring or the product of in vitro chemical or genetic modification. Each 
such variant will be understood to have essentially the same nucleotide sequence as 
the nucleotide sequence of the corresponding N. meningitidis hemoglobin receptor 
protein disclosed herein. 

In another particularly preferred embodiment of this aspect of the invention, 
the hemoglobin receptor protein gene is isolated from Neisseria meningitidis, 
serotype A. In a particular example of this embodiment, the nucleic acid comprises 
a 2373 basepair (bp) polymerase chain reaction-amplified fragment of N. 
meningitidis, serotype A genomic DNA. In this embodiment, the nucleotide 
sequence comprises an open reading frame of 2373 nucleotides of N. meningitidis 
genomic DNA encoding 790 amino acids comprising the hemoglobin receptor gene. 
In this embodiment of the invention, the nucleotide sequence of the N. meningitidis 
hemoglobin receptor gene is the sequence depicted in Figure 7 (SEQ ID No: 3). It 
will be understood that the N. meningitidis gene as disclosed herein is defined, 
insofar as is necessary, by the amino acid sequence of the protein encoded therein, 
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said amino acid sequence being represented in Figure 7 (SEQ. ID No.:4). Thus, it 
will be understood that the particular nucleotide sequence depicted in Figure 7 (SEQ. 
ID. No.: 3) is but one of a number of equivalent nucleotide sequences that encode the 
hemoglobin receptor protein, due to the degeneracy of the genetic code, and that all 
5 such alternative, equivalent nucleotide sequences are hereby explicitly encompassed 

within the disclosed nucleotide sequences of the invention. Also included herein are 
any mutant or allelic variations of this nucleotide sequence, either naturally occurring 
or the product of in vitro chemical or genetic modification. Each such variant will 
be understood to have essentially the same nucleotide sequence as the nucleotide 
10 sequence of the corresponding N. meningitidis hemoglobin receptor protein disclosed 

herein. 

In another particularly preferred embodiment of this aspect of the invention, 
the hemoglobin receptor protein gene is isolated from Neisseria meningitidis, 
serotype B. In a particular example of this embodiment, the nucleic acid comprises 

15 a 2376 basepair (bp) polymerase chain reaction-amplified fragment of AT. 

meningitidis, serotype A genomic DNA. In this embodiment, the nucleotide 
sequence comprises an open reading frame of 2373 nucleotides of N. meningitidis 
genomic DNA encoding 791 amino acids comprising the hemoglobin receptor gene. 
In this embodiment of the invention, the nucleotide sequence of the N. meningitidis 

20 hemoglobin receptor gene is the sequence depicted in Figure 8 (SEQ ID No: 5). It 
will be understood that the N. meningitidis gene as disclosed herein is defined, 
insofar as is necessary, by the amino acid sequence of the protein encoded therein, 
said amino acid sequence being represented in Figure 8 (SEQ. ID No.:6). Thus, it 
will be understood that the particular nucleotide sequence depicted in Figure 8 (SEQ. 

25 ID. No.:5) is but one of a number of equivalent nucleotide sequences that encode the 

hemoglobin receptor protein, due to the degeneracy of the genetic code, and that all 
such alternative, equivalent nucleotide sequences are hereby explicitly encompassed 
within the disclosed nucleotide sequences of the invention. Also included herein are 
any mutant or allelic variations of this nucleotide sequence, either naturally occurring 

30 or the product of in vitro chemical or genetic modification* Each such variant will 
be understood to have essentially the same nucleotide sequence as the nucleotide 
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sequence of the corresponding N. meningitidis hemoglobin receptor protein disclosed 
herein. 

In yet other preferred embodiments, the invention provides nucleic acid 
encoding a hemoglobin receptor protein gene isolated from Neisseria gonorrhoeae. 
In a particular example of this embodiment, the nucleic acid comprises a 2378 
basepair (bp) polymerase chain reaction-amplified fragment of N. gonorrhoeae 
genomic DNA. In this embodiment, the nucleotide sequence comprises an open 
reading frame of 2373 nucleotides of N. gonorrhoeae genomic DNA encoding 791 
amino acids comprising the hemoglobin receptor gene. In this embodiment of the 
invention, the nucleotide sequence of the N. gonorrhoeae hemoglobin receptor gene 
is the sequence depicted in Figure 9 (SEQ ID No:7). It will be understood that the 
N. gonorrhoeae gene as disclosed herein is defined, insofar as is necessary, by the 
amino acid sequence of the protein encoded therein, said amino acid sequence being 
represented in Figure 9 (SEQ. ID No.:8). Thus, it will be understood that the 
15 particular nucleotide sequence depicted in Figure 9 (SEQ. ID. No. :7) is but one of 

a number of equivalent nucleotide sequences that encode the hemoglobin receptor 
protein, due to the degeneracy of the genetic code, and that all such alternative, 
equivalent nucleotide sequences are hereby explicitly encompassed within the 
disclosed nucleotide sequences of the invention. Also included herein are any mutant 
20 or allelic variations of this nucleotide sequence, either naturally occurring or the 

product of in vitro chemical or genetic modification. Each such variant will be 
understood to have essentially the same nucleotide sequence as the nucleotide 
sequence of the corresponding N. gonorrhoeae hemoglobin receptor protein disclosed 
herein. 

25 The invention also provides bacterial hemoglobin receptor proteins. In a 

preferred embodiment, the bacterial hemoglobin receptor protein is isolated from 
bacteria of Neisseria species. In a particularly preferred embodiment, the 
hemoglobin receptor protein is isolated from Neisseria meningitidis. In a particular 
example of this embodiment, the protein is derived from N. meningitidis, serotype 

30 C and comprises an amino acid sequence of 792 amino acids. In this embodiment 
of the invention, the amino acid sequence of the N. meningitidis, serotype C 
hemoglobin receptor protein is the sequence depicted in Figure 2 (SEQ ID No:2). 
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In another example of this embodiment, the protein is derived from N. meningitidis, 
serotype A and comprises an amino acid sequence of 790 amino acids. In this 
embodiment of the invention, the amino acid sequence of the N. meningitidis, 
serotype A hemoglobin receptor protein is the sequence depicted in Figure 7 (SEQ 
5 ID No: 4). In yet another example of this embodiment, the protein is derived from 

N. meningitidis, serotype B and comprises an amino acid sequence of 791 amino 
acids. In this embodiment of the invention, the amino acid sequence of the N. 
meningitidis, serotype B hemoglobin receptor protein is the sequence depicted in 
Figure 8 (SEQ ID No: 6). The invention also provides hemoglobin receptor protein 

10 derived from TV. gonorrhoeae. In this embodiment of the invention, the protein 

comprises an amino acid sequence of 791 amino acids, and the amino acid sequence 
of the AT. gonorrhoeae hemoglobin receptor protein is the sequence depicted in 
Figure 9 (SEQ ID No: 8). Also explicitly encompassed within the scope of this 
invention are related bacterial hemoglobin receptor proteins, particularly such 

15 proteins isolated from Neisseria species, having essentially the same amino acid 

sequence and substantially the same biological properties as the hemoglobin receptor 
protein encoded by the N. meningitidis and N. gonorrhoeae nucleotide sequences 
described herein. 

In another aspect, the invention provides a homogeneous preparation of an 
20 approximately 85.5 kiloDalton (kD) bacterial hemoglobin receptor protein or 

derivative thereof, said size being understood to be the size of the protein before any 
post-translational modifications thereof. Also provided is a 90kD embodiment of the 
receptor as determined by sodium dodecyl sulfate/ polyacrylamide gel electrophoresis 
under reducing conditions. In a preferred embodiment, the bacterial hemoglobin 
25 receptor protein is isolated from bacteria of Neisseria species. In a particularly 

preferred embodiment, the hemoglobin receptor protein is isolated from Neisseria 
meningitidis. In one embodiment of this aspect of the invention, the protein is 
isolated from N. meningitidis, serotype C and the amino acid sequence of the 
bacterial hemoglobin receptor protein or derivative thereof preferably is the amino 
30 acid sequence of the hemoglobin receptor protein shown in Figure 2 (SEQ ID No:2). 

In a second embodiment of this aspect of the invention, the protein is isolated from 
N. meningitidis, serotype A and the amino acid sequence of the bacterial hemoglobin 
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receptor protein or derivative thereof preferably is the amino acid sequence of the 
hemoglobin receptor protein shown in Figure 7 (SEQ ID No:4). In a third 
embodiment of this aspect of the invention, the protein is isolated from N. 
meningitidis, serotype B and the amino acid sequence of the bacterial hemoglobin 
receptor protein or derivative thereof preferably is the amino acid sequence of the 
hemoglobin receptor protein shown in Figure 8 (SEQ ID No:6). The invention also 
provides a homogeneous preparation of a bacterial hemoglobin receptor protein 
isolated from N. gonorrhoeae. In a preferred embodiment, the amino acid sequence 
of the bacterial hemoglobin receptor protein or derivative thereof preferably is the 
amino acid sequence of the hemoglobin receptor protein shown in Figure 9 (SEQ ID 
No:8). 

This invention provides nucleotide probes derived from the nucleotide 
sequences herein provided. The invention includes probes isolated from either 
complementary DNA (cDNA) copies of bacterial messenger RNA (mRNA) or 
bacterial genomic DNA (gDNA), as well as probes made synthetically or by in vitro 
amplification methods using the sequence information provided herein. The 
invention specifically includes but is not limited to oligonucleotide, nick-translated, 
random primed, or in vitro amplified probes made using cDNA or genomic clones 
embodying the invention, and oligonucleotide and other synthetic probes synthesized 
chemically using the nucleotide sequence information of cDNA or genomic clone 
embodiments of the invention. 

It is a further object of this invention to provide such nucleic acid 
hybridization probes to detect the presence of bacteria of Neisseria species, 
particularly N. meningitidis and N. gonorrhoeae, in a biological sample in the 
diagnosis of a Neisseria infection in a human. Such a biological sample preferably 
includes blood, urine, semen, mucus, cerebrospinal fluid, peritoneal fluid and ascites 
fluids, as well as cell scrapings from the epithelium of the mouth, urethra, anus and 
rectum, and other organs. 

The present invention also includes peptides encoded by the nucleotide 
sequences comprising the nucleic acid embodiments of the invention. The invention 
includes either naturally occurring or synthetic peptides which may be used as 
antigens for the production of hemoglobin receptor protein-specific antibodies. The 
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invention also comprises such antibodies, preferably monoclonal antibodies, and cells 
and cultures of cells producing such antibodies. 

Thus, the invention also provides antibodies against and epitopes of bacterial 
hemoglobin receptor proteins of the invention. It is an object of the present 
5 invention to provide antibodies that are immunologically reactive to the bacterial 

hemoglobin receptor proteins of the invention. It is a particular object to provide 
monoclonal antibodies against these bacterial hemoglobin receptor proteins. In a 
preferred embodiment, antibodies provided are raised against bacterial hemoglobin 
receptor protein isolated from bacteria of Neisseria species. In a particularly 
10 preferred embodiment, such antibodies are specific for the hemoglobin receptor 

protein isolated from Neisseria meningitidis serotypes A, B or C. In additional 
particularly preferred embodiment, such antibodies are specific for the hemoglobin 
receptor protein isolated from Neisseria gonorrhoeae. 

Hybridoma cell lines producing such antibodies are also objects of the 
15 invention. It is envisioned at such hybridoma cell lines may be produced as the 

result of fusion between a non-immunoglobulin producing mouse myeloma cell line 
and spleen cells derived from a mouse immunized with purified hemoglobin receptor 
protein or a cell expressing antigens or epitopes of bacterial hemoglobin receptor 
proteins of the invention. The present invention also provides hybridoma cell lines 
20 that produce such antibodies, and can be injected into a living mouse to provide an 
ascites fluid from the mouse that is comprised of such antibodies. In a preferred 
embodiment, antibodies provided are raised against bacterial hemoglobin receptor 
protein isolated from bacteria of Neisseria species. In a particularly preferred 
embodiment, such antibodies are specific for the hemoglobin receptor protein isolated 
25 from Neisseria meningitidis, serotypes A, B or C. In additional particularly 

preferred embodiment, such antibodies are specific for the hemoglobin receptor 
protein isolated from Neisseria gonorrhoeae. 

It is a further object of the invention to provide immunologically-active 
epitopes of the bacterial hemoglobin receptor proteins of the invention. Chimeric 
30 antibodies immunologically reactive against the bacterial hemoglobin receptor 
proteins of the invention are also within the scope of this invention. In a preferred 
embodiment, antibodies and epitopes provided are raised against or derived from 
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bacterial hemoglobin receptor protein isolated from bacteria of Neisseria species. 
In a particularly preferred embodiment, such antibodies and epitopes are specific for 
the hemoglobin receptor protein isolated from Neisseria meningitidis, serotypes A, 
B or C. In additional particularly preferred embodiment, such antibodies and 
epitopes are specific for the hemoglobin receptor protein isolated from Neisseria 
gonorrhoeae. 

The present invention provides recombinant expression constructs comprising 
a nucleic acid encoding a bacterial hemoglobin receptor protein wherein the construct 
is capable of expressing the encoded hemoglobin receptor protein in cultures of cells 
transformed with the construct. Preferred embodiments of such constructs comprise 
the N. meningitidis, serotype C hemoglobin receptor gene depicted in Figure 2 (SEQ 
ID No..l), such constructs being capable of expressing the bacterial hemoglobin 
receptor protein encoded therein in cells transformed with the construct. Additional 
preferred embodiments of such constructs comprise the N. meningitidis, serotype A 
hemoglobin receptor gene depicted in Figure 7 (SEQ ID No.. 3), such constructs 
being capable of expressing the bacterial hemoglobin receptor protein encoded 
therein in cells transformed with the construct. Further additional preferred 
embodiments of such constructs comprise the N. meningitidis, serotype B hemoglobin 
receptor gene depicted in Figure 8 (SEQ ID No.:5), such constructs being capable 
of expressing the bacterial hemoglobin receptor protein encoded therein in cells 
transformed with the construct. The invention also provides recombinant expression 
constructs encoding a hemoglobin receptor protein gene isolted from ZN. 
gonorrhoeae. In a particularly preferred embodiment, such constructs comprise the 
N. gonorrhoeae hemoglobin receptor gene depicted in Figure 9 (SEQ ID No. :7), the 
constructs being capable of expressing the bacterial hemoglobin receptor protein 
encoded therein in cells transformed with the construct. 

The invention also provides cultures of cells, preferably bacterial cells, having 
been transformed with the recombinant expression constructs of the invention, each 
such cultures being capable of and in fact expressing the bacterial hemoglobin 
receptor protein encoded in the transforming construct. 

The present invention also includes within its scope protein preparations of 
prokaryotic cell membranes containing the bacterial hemoglobin receptor protein of 
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the invention, derived from cultures of prokaryotic cells transformed with the 
recombinant expression constructs of the invention. 

The invention also provides diagnostic reagents and methods for using such 
reagents for detecting the existence of an infection in a human, with bacteria of a 
Neisseria species. In preferred embodiments, such diagnostic reagents comprise 
antibodies that are immunologically reactive with a bacterial hemoglobin receptor 
protein. In a preferred embodiment, such antibodies are raised against a bacterial 
hemoglobin receptor protein isolated from bacteria of Neisseria species. In a 
particularly preferred embodiment, such antibodies are specific for the hemoglobin 
receptor protein isolated from Neisseria meningitidis, serotypes A, B or C In 
additional particularly preferred embodiments, such antibodies are specific for the 
hemoglobin receptor protein isolated from Neisseria gonorrhoeae. 

In yet another embodiment of this aspect of the invention are provided 
diagnostic reagents and methods for using such reagents wherein said reagents are 
nucleic acid hybridization probes comprising a bacterial hemoglobin receptor gene. 
In a preferred embodiment, the bacterial hemoglobin receptor protein gene is isolated 
from bacteria of Neisseria species. In a particularly preferred embodiment, the 
hemoglobin receptor protein gene is isolated from Neisseria meningitidis. In 
particular examples of this embodiment of the invention, the nucleic acid probes 
comprise a specifically-hybridizing fragment of a 3.3 kilobase (kb) BamHl/Hindlll 
fragment of N. meningitidis, serotype C genomic DNA. In this embodiment, the 
nucleotide sequence comprises all or a specifically-hybridizing fragment of an open 
reading frame of 2376 nucleotides of N. meningitidis, serotype C genomic DNA 
encoding 792 amino acids comprising the hemoglobin receptor gene. In this 
embodiment of the invention, the nucleotide sequence of the N. meningitidis, 
serotype C hemoglobin receptor gene is the sequence depicted in Figure 2 (SEQ ID 
No:l). In another example of this embodiment of the invention, the nucleic acid 
probes comprise a specifically-hybridizing fragment of a 2373bp, polymerase chain 
reaction-amplified fragment of N meningitidis, serotype A genomic DNA. In this 
embodiment, the nucleotide sequence comprises all or a specifically-hybridizing 
fragment of an open reading frame of 2370 nucleotides of N. meningitidis, serotype 
A genomic DNA encoding 790 amino acids comprising the hemoglobin receptor 
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gene. In this embodiment of the invention, the nucleotide sequence of the N. 
meningitidis, serotype A hemoglobin receptor gene is the sequence depicted in Figure 
7 (SEQ ID No: 3). In yet another example of this embodiment of the invention, the 
nucleic acid probes comprise a specifically-hybridizing fragment of a 2376bp, 
polymerase chain reaction-amplified fragment of N. meningitidis, serotype B genomic 
DNA. In this embodiment, the nucleotide sequence comprises all or a specifically- 
hybridizing fragment of an open reading frame of 2373 nucleotides of N. 
meningitidis, serotype B genomic DNA encoding 791 amino acids comprising the 
hemoglobin receptor gene. In this embodiment of the invention, the nucleotide 
sequence of the N. meningitidis, serotype B hemoglobin receptor gene is the 
sequence depicted in Figure 8 (SEQ ID No:5). The invention also provides nucleic 
acid hybridization probes comprising a bacterial hemoglobin receptor gene isolated 
from N. gonorrhoeae. In a preferred embodiment of this aspect of the invention, the 
nucleic acid probes comprise a specifically-hybridizing fragment of a 2378bp, 
polymerase chain reaction-amplified fragment ofN. gonorrhoeae genomic DNA. In 
this embodiment, the nucleotide sequence comprises all or a specifically-hybridizing 
fragment of an open reading frame of 2373 nucleotides of N. gonorrhoeae genomic 
DNA encoding 791 amino acids comprising the hemoglobin receptor gene. In this 
embodiment of the invention, the nucleotide sequence of the N. gonorrhoeae 
hemoglobin receptor gene is the sequence depicted in Figure 9 (SEQ ID No: 7). It 
will be understood that the term "specifically-hybridizing" when used to describe a 
fragment of a nucleic acid encoding a bacterial hemoglobin receptor gene is intended 
to mean that nucleic acid hybridization of such a fragment is stable under high 
stringency conditions of hybridization and washing as the term "high stringency" 
would be understood by those having skill in the molecular biological arts. 

Also provided by the invention are therapeutic agents and methods for using 
such agents for treating the an infection in a human, with bacteria of a Neisseria 
species. In preferred embodiments, such agents comprise antibodies that are 
immunologically reactive with a bacterial hemoglobin receptor protein. In a 
preferred embodiment, such antibodies are raised against a bacterial hemoglobin 
receptor protein isolated from bacteria of Neisseria species. In a particularly 
preferred embodiment, such antibodies are specific for the hemoglobin receptor 
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protein isolated from Neisseria meningitidis, serotypes A, B or C. In additional 
preferred embodiments, such antibodies are specific for the hemoglobin receptor 
protein isolated from Neisseria gonorrhoeae. Therapeutic agents provided in this 
aspect of the invention comprise such antibodies in a phannaceutically-acceptable 
carrier, along with appropriate adjuvants and the like. In additional embodiments, 
such antibodies are covalently conjugated to a bactericidal or bacteriostatic agent 
effective against bacteria of Neisseria species, preferably N meningitidis and N. 
gonorrhoeae. 

In yet another embodiment of this aspect of the invention are provided 
therapeutic reagents and methods for using such reagents wherein said reagents 
comprise recombinant expression constructs of the invention, or a homologue thereof 
that expresses the nucleic acid encoding a hemoglobin receptor in an antisense 
orientation. In a preferred embodiment, the bacterial hemoglobin receptor protein 
gene is isolated from bacteria of Neisseria species. In a particularly preferred 
embodiment, the hemoglobin receptor protein gene is isolated from Neisseria 
meningitidis. In particular examples of this embodiment of the invention, the nucleic 
acids comprise a specifically-hybridizing fragment of a 3.3 kilobase (kb) 
BamKl/HindJII fragment of N, meningitidis, serotype C genomic DNA. In this 
embodiment, the nucleotide sequence comprises all or a specifically-hybridizing 
fragment of an open reading frame of 2376 nucleotides of N. meningitidis, serotype 
C genomic DNA encoding 792 amino acids comprising the hemoglobin receptor 
gene. In this embodiment of the invention, the nucleotide sequence of the N. 
meningitidis, serotype C hemoglobin receptor gene is the sequence depicted in Figure 
2 (SEQ ID No:l). In another example of this embodiment of the invention, the 
nucleic acid probes comprise a specifically-hybridizing fragment of a 2373bp, 
polymerase chain reaction-amplified fragment of N. meningitidis, serotype A 
genomic DNA. In this embodiment, the nucleotide sequence comprises all or a 
specifically-hybridizing fragment of an open reading frame of 2370 nucleotides of 
N. meningitidis, serotype A genomic DNA encoding 790 amino acids comprising the 
hemoglobin receptor gene. In this embodiment of the invention, the nucleotide 
sequence of the N. meningitidis, serotype A hemoglobin receptor gene is the 
sequence depicted in Figure 7 (SEQ ID No: 3). In yet another example of this 
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embodiment of the invention, the nucleic acid probes comprise a specifically- 
hybridizing fragment of a 2376bp, polymerase chain reaction-amplified fragment of 
N. meningitidis, serotype B genomic DNA. In this embodiment, the nucleotide 
sequence comprises all or a specifically-hybridizing fragment of an open reading 
5 frame of 2373 nucleotides of N. meningitidis, serotype B genomic DNA encoding 

791 amino acids comprising the hemoglobin receptor gene. In this embodiment of 
the invention, the nucleotide sequence of the N. meningitidis, serotype B hemoglobin 
receptor gene is the sequence depicted in Figure 8 (SEQ ID No:5). The invention 
also provides recombinant expression constructs of the invention, or a homologue 

10 thereof that expresses the nucleic acid encoding a hemoglobin receptor in an 

antisense orientation, wherein the nucleic acid encodes a bacterial hemoglobin 
receptor gene isolated from M gonorrhoeae. In a preferred embodiment of this 
aspect of the invention, the nucleic acid probes comprise a specifically-hybridizing 
fragment of a 2378bp, polymerase chain reaction-amplified fragment of N. 

15 gonorrhoeae genomic DNA. In this embodiment, the nucleotide sequence comprises 

all or a specifically-hybridizing fragment of an open reading frame of 2373 
nucleotides of N. gonorrhoeae genomic DNA encoding 791 amino acids comprising 
the hemoglobin receptor gene. In this embodiment of the invention, the nucleotide 
sequence of the N. gonorrhoeae hemoglobin receptor gene is the sequence depicted 

20 in Figure 9 (SEQ ID No:7). 

The invention also provides a method for screening compounds for their 
ability to inhibit, facilitate or modulate the biochemical activity of a bacterial 
hemoglobin receptor protein of the invention, for use in the in vitro screening of 
novel agonist and antagonist compounds and novel bactericidal and bacteriostatic 

25 agents specific for the hemoglobin receptor protein. In preferred embodiments, cells 

transformed with a recombinant expression construct of the invention are contacted 
with such a compound, and the binding capacity of the compounds, as well as the 
effect of the compound on binding of other, known hemoglobin receptor agonists 
such as hemoglobin and hemin, and antagonists, is assayed. Additional preferred 

30 embodiments comprise quantitative analyses of such effects. 

The present invention is also useful for the detection of bactericidal and/or 
bacteriostatic analogues, agonists or antagonists, known or unknown, of a bacterial 
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hemoglobin receptor protein, preferably derived from bacteria of Neisseria species, 
most preferably isolated from N. meningitidis, wherein such compounds are either 
naturally occurring or embodied as a drug. 

The invention also provides vaccines for immunizing a human against 
5 infection with pathogenic bacteria of Neisseria species, the vaccines comprising the 

hemoglobin binding proteins of the invention or antigenic fragments thereof. In a 
preferred embodiment, the vaccines of the invention comprise cells expressing a 
hemoglobin receptor binding protein of the invention, or an antigenic fragment 
thereof, preferably wherein said cells are attenuated varieties of cells adapted for 

10 growth in humans, i.e., wherein such cells are non-pathogenic and do not cause 

bactermia, endotoxemia or sepsis. Examples of such attenuated varieties of cells 
include attenuated strains of Salmonella species, for example Salmonella typhi and 
Salmonella typhimurium, as well as other attenuated bacterial species. Also provided 
by the invention are recombinant expression constructs as disclosed herein useful per 

15 se as vaccines, for introduction into an animal and production of an immunologic 

response to bacterial hemoglobin receptor protein antigens encoded therein. 

Specific preferred embodiments of the present invention will become evident 
from the following more detailed description of certain preferred embodiments and 
the claims. 

20 

DESCRIPTION OF THE DRAWINGS 

The foregoing and other objects of the present invention, the various features 
thereof, as well as the invention itself may be more fully understood from the 
following description, when read together with the accompanying drawings in which: 
25 Figure 1 is a schematic drawing of the restriction enzyme digestion map of 

a N. meningitidis cosmid clone and subclones thereof derived as described in 
Example 2. 

Figure 2 illustrates the nucleotide (SEQ ID No.:l) and deduced amino acid 
(SEQ ID No,:2) sequences of the N. meningitidis hemoglobin receptor protein 
30 encoded in a 3.3 kb BamHI/HindUl DNA fragment. 

Figure 3 presents a photograph of a stained SDS/ 10% PAGE electrophoresis 
gel showing the results of in vitro expression of the N. meningitidis hemoglobin 
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receptor gene product as an approximately 90 kilodalton protein, and 0-lactamase 
protein having a molecular weight of about 30.0 kilodaltons used as a molecular 
weight marker. 

Figure 4 presents an amino acid sequence comparison between portions of the 
N. meningitidis transferrin receptor Tbpl (SEQ ID No.:9), the N. meningitidis 
lactoferrin receptor LbpA (SEQ ID No. .10), and N. meningitidis hemoglobin 
receptor HmbR (SEQ ID No.:2). 

Figure 5 illustrates Southern hybridization analysis of chromosomal DNA 
from N. meningitidis 8013 and the MC8013/im£>tf mutant using a BamHI-Sall 
fragment of the hmb gene as probe labeled using a DIG nonradioactive DNA 
labelling and detection kit (Boehringer Mannheim Biochemicals, Indianapolis, IN). 
Lane 1 contains DNA from N. meningitidis strain MC8013, digested with Ctal; lane 
2 is MC&03lhmbR DNA digested with Clal, lane 3, is MC8013 DNA digested with 
BamHl and Sail; and lane 4 is MCS0l3hmbR DNA digested with BamUl and Sail. 

Figure 6 is a graph describing the course of infection using N. meningitidis 
wild type (MC8013) and hmbR mutant strains in an in vivo rat infant infection 
model. Each strain was injected intraperitoneally (2 x 10 6 CFU) into three infant 
inbred Lewis rats. The results represent the average of two similarly-performed 
experiments. 

Figure 7 illustrates the nucleotide (SEQ ID No.:3) and deduced amino acid 
(SEQ ID No.: 4) sequences of the N. meningitidis, serotype A hemoglobin receptor 
protein encoded on a 2373bp polymerase chain reaction-amplified DNA fragment. 

Figure 8 illustrates the nucleotide (SEQ ID No.: 5) and deduced amino acid 
(SEQ ID No.:6) sequences of the N. meningitidis, serotype B hemoglobin receptor 
protein encoded on a 2376bp polymerase chain reaction-amplified DNA fragment. 

Figure 9 illustrates the nucleotide (SEQ ID No.: 7) and deduced amino acid 
(SEQ ID No.: 8) sequences of the N. gonorrhoeae hemoglobin receptor protein 
encoded on a 2376bp polymerase chain reaction-amplified DNA fragment. 

Figure 10 represents a schematic of a nucleic acid sequence comparison 
between the hemoglobin receptor proteins derived from N. meningitidis, serotypes 
A (SEQ ID No.:3), B (SEQ ID No.:5) and C (SEQ ID No.:l) and from N. 
gonorrhoeae (SEQ ID No.:7), wherein the direction of trascription of the genes is 
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in the direction of the arrow, and the following abbreviations refer to restriction 
endonuclease sites: H represents HindlU; N represents Noil; Bg represents Bgll; Bs 
represents BssHl; Nr represents Nrul; CI represents Clal; P represents Pstl; Sa 
represents Sacl; Av represents Aval; B represents BamHl; S represents Sail; EV 
represents EcoRV; Sh represents Sphl; and Sy represents Styl. 

Figure 11 presents an amino acid sequence comparison between the 
hemoglobin receptor proteins derived from N meningitidis, serotypes A (SEQ ID 
No.:4), B (SEQ ID No.:6) and C (SEQ ID No.:2) and from N. gonorrhoeae (SEQ 
ID No.: 8). 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The term "bacterial hemoglobin receptor" as used herein refers to bacterial 
proteins comprising the outer membrane of Gram negative bacteria, which 
specifically mediate transit of hemoglobin-derived hemin, as well as hemin from 
other sources, through the outer membrane of such bacteria and into the periplasmic 
space. The bacterial hemoglobin receptor proteins of the invention are characterized 
by, first, an amino acid sequence that is essentially the sequence depicted in Figures 
2 (SEQ ID No.:2), 7 (SEQ ID No.:4), 8 (SEQ ID No.:6) and 9 (SEQ ID No.: 8). 
The bacterial hemoglobin receptor proteins of the invention are further characterized 
by having substantially the same biological activity as a protein having the amino 
acid sequence depicted in Figures 2 (SEQ ID No.:2), 7 (SEQ ID No.:4), 8 (SEQ ID 
No.: 6) and 9 (SEQ ID No.: 8). This definition is intended to encompass naturally- 
occurring variants and mutant proteins, as well as genetically engineered variants 
made by man. 

Cloned, isolated and purified nucleic acid provided by the present invention 
may encode a bacterial hemoglobin receptor protein of any Neisseria species of 
origin, including, most preferably. Neisseria meningitidis species and serotypes 
thereof and Neisseria gonorhoeae species. 

The nucleic acid hybridization probes provided by the invention comprise 
DNA or RNA having all or a specifically-hybridizing fragment of the nucleotide 
sequence of the hemoglobin receptor protein as depicted in Figures 2 (SEQ ID 
No.:l), 7 (SEQ ID No.:3), 8 (SEQ ID No.:5) and 9 (SEQ ID No.:7), or any portion 
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thereof effective in nucleic acid hybridization. Mixtures of such nucleic acid 
hybridization probes are also within the scope of this embodiment of the invention. 
Nucleic acid probes as provided herein are useful for detecting the presence of a 
bacteria, inter alia, in a human as the result of an infection, in contaminated 
biological samples and specimens, in foodstuffs and water supplies, or in any 
substance that may come in to contact with the human. Specific hybridization will 
be understood to mean that the nucleic acid probes of the invention are capable of 
forming stable, specific hybridization to bacterially-derived DNA or RNA under 
conditions of high stringency, as the term "high stringency" would be understood by 
those with skill in the art {see, for example, Sambrook et al. , 1989, Molecular 
Cloning: A Laboratory Manual , Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y. and Hames and Higgins, eds., 1985, Nucleic Acid Hybridization IRL 
Press, Oxford, U.K.). Hybridization will be understood to be accomplished using 
well-established techniques, including but not limited to Southern blot hybridization. 
Northern blot hybridization, in situ hybridization and Southern hybridization to 
polymerase chain reaction product DNAs. The invention will thus be understood to 
provide oligonucleotides, specifically, pairs of oligonucleotides, for use as primers 
in support of in vitro amplification of bacterial hemoglobin receptor genes and 
mRNA transcripts. 

The production of proteins such as bacterial hemoglobin receptor proteins 
from cloned genes by genetic engineering means is well known in this art. The 
discussion which follows is accordingly intended as an overview of this field, and is 
not intended to reflect the full state of the art. It will be understood from the 
following discussion that the hemoglobin receptor protein genes of this invention are 
particularly advantageous, since expression of such proteins by bacteria, including 
non-Neisseria species of bacteria, can complement certain auxotrophic mutants of 
said transformed bacteria otherwise unable to subsist absent supplementation of the 
growth media with iron (HI). 

DNA encoding a bacterial hemoglobin receptor protein, in view of the instant 
disclosure, by chemical synthesis, by screening reverse transcripts of mRNA from 
appropriate cells, by screening genomic libraries from appropriate cells, or by 
combinations of these procedures, as illustrated below. Screening of mRNA or 
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genomic DNA may be carried out with oligonucleotide probes generated from the 
nucleic acid sequence information from the bacterial hemoglobin receptor protein 
disclosed herein. Probes may be labeled with a detectable group such as a 
fluorescent group, a radioactive atom or a chemiluminescent group in accordance 
with know procedures and used in conventional hybridization assays, as described 
in greater detail in the Examples below. In the alternative, bacterial hemoglobin 
receptor protein-encoding nucleic acids may be obtained by use of the polymerase 
chain reaction (PCR) procedure, using appropriate pairs of PCR oligonucleotide 
primers corresponding to nucleic acid sequence information derived from a bacterial 
hemoglobin receptor protein as provided herein. See U.S. Patent Nos. 4,683,195 
to Mullis et al. and 4,683,202 to Mullis, as specifically disclosed herein in Example 
9 below. In another alternative, such bacterial hemoglobin receptor protein-encoding 
nucleic acids may be isolated from auxotrophic cells transformed with a bacterial 
hemoglobin receptor protein gene, thereby relieved of the nutritional requirement for 
uncomplexed iron (III). 

Any bacterial hemoglobin receptor protein of the invention may be 
synthesized in host cells transformed with a recombinant expression construct 
comprising a nucleic acid encoding the bacterial hemoglobin receptor protein. Such 
recombinant expression constructs can also be comprised of a vector that is a 
replicable DNA construct. Vectors are used herein either to amplify DNA encoding 
a bacterial hemoglobin receptor protein and/or to express DNA encoding a bacterial 
hemoglobin receptor protein. For the purposes of this invention, a recombinant 
expression construct is a replicable DNA construct in which a nucleic acid encoding 
a bacterial hemoglobin receptor protein is operably linked to suitable control 
sequences capable of effecting the expression of the bacterial hemoglobin receptor 
protein in a suitable host cell. 

The need for such control sequences will vary depending upon the host cell 
selected and the transformation method chosen. Generally, bacterial control 
sequences include a transcriptional promoter, an optional operator sequence to 
control transcription, a sequence encoding suitable mRNA ribosomal binding sites 
(the Shine-Delgarno sequence), and sequences which control the termination of 
transcription and translation. Amplification vectors do not require expression control 
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domains. All that is needed is the ability to replicate in a host, usually conferred by 
an origin of replication, and a selection gene to facilitate recognition of 
transformants. See, Sambrook et al., 1989, ibid. 

Vectors useful for practicing the present invention include plasmids and virus- 
derived constructs, including phage and particularly bacteriophage, and integratable 
DNA fragments (i.e., fragments integratable into the host genome by homologous 
recombination). The vector replicates and functions independently of the host 
genome, or may, in some instances, integrate into the genome itself. Suitable 
vectors will contain replicon and control sequences which are derived from species 
compatible with the intended expression host. A preferred vector is pLAFR2 (see 
Riboli et al., 1991, Microb. Pathogen. 10: 393-403). 

Transformed host cells are cells which have been transformed or transfected 
with recombinant expression constructs made using recombinant DNA techniques and 
comprising nucleic acid encoding a bacterial hemoglobin receptor protein. Preferred 
host cells are cells of Neisseria species, particularly N. meningitidis, as well as 
Salmonella typhi and Salmonella typhimurium species, and Escherichia coli 
auxotrophic mutant cells (hemA aroB). Transformed host cells may express the 
bacterial hemoglobin receptor protein, but host cells transformed for purposes of 
cloning or amplifying nucleic acid hybridization probe DNA need not express the 
receptor protein. When expressed, the bacterial hemoglobin receptor protein of the 
invention will typically be located in the host cell outer membrane. See, Sambrook 
et al., ibid. 

Cultures of bacterial cells, particularly cells of Neisseria species, and certain 
E. coli mutants, are a desirable host for recombinant bacterial hemoglobin receptor 
protein synthesis. In principal, any bacterial cell auxotrophic for uncomplexed iron 
ail) is useful for selectively growing bacterial hemoglobin receptor protein- 
transformed cells. However, for this purpose, well-characterized auxotrophs, such 
as E. coli hemA aroB mutants are preferred. 

The invention provides homogeneous compositions of a bacterial hemoglobin 
receptor protein produced by transformed cells as provided herein. Each such 
homogeneous composition is intended to be comprised of a bacterial hemoglobin 
receptor protein that comprises at least 90% of the protein in such a homogenous 
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composition. The invention also provides membrane preparations from cells 
expressing a bacterial hemoglobin receptor protein as the result of transformation 
with a recombinant expression construct of the invention, as described herein. 

Bacterial hemoglobin receptor proteins, peptide fragments thereof and 
5 membranes derived from cells expressing such proteins in accordance with the 

present invention may be used for the production of vaccines effective against 
bacterial infections in a human, with pathogenic microorganisms expressing such 
bacterial hemoglobin receptor proteins. Such vaccines preferably would be effective 
in raising an immunological response against bacteria of Neisseria species, most 
10 preferably N. meningitidis and M gonorhoeae. Also encompassed within the 

vaccines provided by the invention are recombinant expression constructs as 
disclosed herein useful per se as vaccines, for introduction into an animal and 
production of an immunologic response to bacterial hemoglobin receptor protein 
antigens encoded therein. 
15 Preparation of vaccines which contain polypeptide or polynucleotide 

sequences as active ingredients is well understood in the art. Typically, such 
vaccines are prepared as injectables, either as liquid solutions or suspensions. 
However, solid forms suitable for solution in, or suspension in, liquid prior to 
injection may also be prepared. The preparation may also be emulsified. The active 
20 immunogenic ingredient is often mixed with excipients which are pharmaceutical^ 

acceptable and compatible with the active ingredient. Suitable excipients are, for 
example, water, saline, dextrose, glycerol, ethanol, or the like and combinations 
thereof. In addition, if desired, the vaccine may contain minor amounts of auxiliary 
substances such as wetting or emulsifying agents, pH buffering agents, or adjuvants 
25 which enhance the effectiveness of the vaccine. The vaccines are conventionally 

administered parenterally, by injection, for example, either subcutaneously or 
intramuscularly. Additional formulations which are suitable for other modes of 
administration include suppositories and, in some cases, oral formulations. For 
suppositories, traditional binders and carriers may include, for example, polyalkalene 
30 glycols or triglycerides; such suppositories may be formed from mixtures containing 
the active ingredient in the range of 0.5% to 10%, preferably 1 to 2%. Oral 
formulations include such normally employed excipients as, for example, 
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pharmaceutical grades of manitol, lactose, starch, magnesium stearate, sodium 
saccharine, cellulose, magnesium carbonate and the like. These compositions take 
the form of solutions, suspensions, tablets, pills, capsules, sustained release 
formulations or powders and contain 10% to 95% of active ingredient, preferably 25 
to 70%. 

The polypeptides of the invention may. be formulated into the vaccine as 
neutral or salt forms. Pharmaceutically acceptable salts, include the acid additional 
salts (formed with the free amino groups of the peptide) and which are formed with 
inorganic acids such as, for example, hydrochloric or phosphoric acids, or such 
organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with 
the free carboxyl groups may also be derived from inorganic bases such as, for 
example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such 
organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, 
procaine, and the like. 

In another embodiment, such vaccines are provided wherein the bacterial 
hemoglobin receptor proteins or peptide fragments thereof are present in the intact 
cell membranes of cells expressing such proteins in accordance with the present 
invention. In preferred embodiments, cells useful in these embodiments include 
attenuated varieties of cells adapted to growth in humans. Most preferably, said cells 
are attenuated varieties of cells adapted for growth in humans, i.e.., wherein such 
cells do not cause frank disease or other pathological conditions, such as bactermia, 
endotoxemia or sepsis. For the purposes of this invention, "attenuated" cells will be 
understood to encompass prokaryotic and eukaryotic cells that do not cause infection, 
disease, septicemia, endotoxic shock, pyrogenic shock, or other serious and adverse 
reactions to administration of vaccines to an animal, most preferably a human, when 
such cells are introduced into the animal, whether such cells are viable, living, heat-, 
chemically- or genetically attenuated or inactivated, or dead. It will be appreciated 
by those with skill in this art that certain minor side-effects of vaccination, such as 
short-term fever, muscle discomfort, general malaise, and other well-known reactions 
to vaccination using a variety of different types of vaccines, can be anticipated as 
accompanying vaccination of an animal, preferably a human, using the vaccines of 
the invention. Such acute, short-term and non-life-threatening side effects are 
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encompassed in the instant definition of the vaccines of the invention, and vaccines 
causing such side-effects fall within the definition of "attenuated" presented herein. 
Preferred examples of such attenuated cells include attenutated varieties of 
Salmonella species, preferably Salmonella typhi and Salmonella typhimurium, as well 
as other attenuated bacterial species. It will be specifically understood that these 
embodiments of the vaccines of the invention encompass so-called "live" attenuated 
cell preparations as well as heat- or chemically- inactivated cell preparations. 

In other embodiments of the invention are provided vaccines that are DNA 
vaccines, comprising the nucleic acids of the invention in recombinant expression 
constructs competant to direct expression of hemoglobin receptor proteins when 
introduced into an animal. In preferred embodiments, such DNA vaccines comprise 
recombinant expression constructs wherein the hemoglobin receptor-encoding nucleic 
acids of the invention are operably linked to promoter elements, most preferably the 
early gene promoter of cytomegalovirus or the early gene promoter of simian virus 
40. DNA vaccines of the invention are preferably administered by intramuscular 
injection, but any appropriate route of administration, including oral, transdermal, 
rectal, nasal, aerosol administration into lung, or any other clinically-acceptable route 
of administration can be used by those with skill in the art. 

In general, the vaccines of the invention are administered in a manner 
compatible with the dosage formulation, and in such amount as will be 
therapeutically effective and immunogenic. The quantity to be administered depends 
on the subject to be treated, capacity of the subject's immune system to synthesize 
antibodies, and the degree of protection desired. Precise amounts of active 
ingredient required to be administered depend on the judgment of the practitioner and 
are peculiar to each individual. However, suitable dosage ranges are of the order 
of several hundred micrograms active ingredient per individual. Suitable regimes for 
initial administration and booster shots are also variable, but are typified by an initial 
administration followed in one or two week intervals by a subsequent injection or 
other administration. 

The recombinant expression constructs of the present invention are also useful 
in molecular biology to transform bacterial cells which do not ordinarily express a 
hemoglobin receptor protein to thereafter express this receptor. Such cells are 
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useful, inter alia, as intermediates for making cell membrane preparations useful for 
receptor binding activity assays, vaccine production, and the like, and in certain 
embodiments may themselves be used, inter alia, as vaccines or components of 
vaccines, as described above. The recombinant expression constructs of the present 
invention thus provide a method for screening potentially useful bactericidal and 
bacteriostatic drugs at advantageously lower cost than conventional screening 
protocols. While not completely eliminating the need for ultimate in vivo activity 
and toxicology assays, the constructs and cultures of the invention provide an 
important first screening step for the vast number of potentially useful bactericidal 
and bacteriostatic drugs synthesized, discovered or extracted from natural sources 
each year. In addition, such bactericidal or bacteriostatic drugs would be selected 
to utilize a nutritional pathway associated with infectious virulence in these types of 
bacteria, as disclosed in more detail below, thus selectively targeting bacteria 
associated with the development of serious infections in vivo. 

Also, the invention provides both functional bacterial hemoglobin receptor 
proteins, membranes comprising such proteins, cells expressing such proteins, and 
the amino acid sequences of such proteins. This invention thereby provides sufficient 
structural and functional activity information to enable rational drug design of novel 
therapeutically-active antibacterial drugs using currently-available techniques (see 
Walters, "Computer-Assisted Modeling of Drugs", in Klegerman & Groves, eds., 
1993 > Pharmaceutical Biotechnnlnpy Interpharm Press: Buffalo Grove, IL, pp. 165- 
174). 

Nucleic acids and oligonucleotides of the present invention are useful as 
diagnostic tools for detecting the existence of a bacterial infection in a human, caused 
by a hemoglobin receptor protein-expressing pathological organism of Neisseria 
species. Such diagnostic reagents comprise nucleic acid hybridization probes of the 
invention and encompass paired oligonucleotide PCR primers, as described above. 
Methods provided by the invention include blot hybridization, in situ hybridization 
and in vitro amplification techniques for detecting the presence of pathogenic bacteria 
in a biological sample. Appropriate biological samples advantageously screened 
using the methods described herein include plasma, serum, lymph, cerebrospinal 
fluid, seminal fluid, mucosal tissue samples, biopsy samples, and other potential sites 
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of bacterial infection. It is also envisioned that the methods of the invention may be 
used to screen water, foodstuffs, pharmaceuticals, and other potential sources of 
infection. 

The invention also provides antibodies that are immunologically reactive to 
a bacterial hemoglobin receptor protein or epitopes thereof provided by the 
invention. The antibodies provided by the invention may be raised, using methods 
well known in the art, in animals by inoculation with cells that express a bacterial 
hemoglobin receptor protein or epitopes thereof, cell membranes from such cells, 
whether crude membrane preparations or membranes purified using methods well 
known in the art, or purified preparations of proteins, including fusion proteins, 
particularly fusion proteins comprising epitopes of a bacterial hemoglobin receptor 
protein of the invention fused to heterologous proteins and expressed using genetic 
engineering means in bacterial, yeast or eukaryotic cells, said proteins being isolated 
from such cells to varying degrees of homogeneity using conventional biochemical 
means. Synthetic peptides made using established synthetic means in vitro and 
optionally conjugated with heterologous sequences of amino acids, are also 
encompassed in these methods to produce the antibodies of the invention. Animals 
that are used for such inoculations include individuals from species comprising cows, 
sheep, pigs, mice, rats, rabbits, hamsters, goats and primates. Preferred animals for 
inoculation are rodents (including mice, rats, hamsters) and rabbits. The most 
preferred animal is the mouse. 

Cells that can be used for such inoculations, or for any of the other means 
used in the invention, include any cell that naturally expresses a bacterial hemoglobin 
receptor protein as provided by the invention, or any cell or cell line that expresses 
a bacterial hemoglobin receptor protein of the invention, or any epitope thereof, as 
a result of molecular or genetic engineering, or that has been treated to increase the 
expression of an endogenous or heterologous bacterial hemoglobin receptor protein 
by physical, biochemical or genetic means. Preferred cells are E. coli auxotrophic 
mutant hemA aroB cells transformed with a recombinant expression construct of the 
invention and grown in media supplemented with hemin or hemoglobin as the sole 
iron (HI) source, and cells of Neisseria species. 
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The present invention also provides monoclonal antibodies that are 
immunologically reactive with an epitope of a bacterial hemoglobin receptor protein 
of the invention, or fragment thereof, present on the surface of such cells, preferably 
E. coli cells. Such antibodies are made using methods and techniques well known 
to those of skill in the art. Monoclonal antibodies provided by the present invention 
are produced by hybridoma cell lines, that are also provided by the invention and 
that are made by methods well known in the art (see Harlow and Lane, 1988, 
Antibodies: A Laboratory Manual Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N.Y.). 

Hybridoma cell lines are made by fusing individual cells of a myeloma cell 
line with spleen cells derived from animals immunized with a homogeneous 
preparation of a bacterial hemoglobin receptor protein, membranes comprised 
thereof, cells expressing such protein, or epitopes of a bacterial hemoglobin receptor 
protein, used per se or comprising a heterologous or fusion protein construct, as 
described above. The myeloma cell lines used in the invention include lines derived 
from myelomas of mice, rats, hamsters, primates and humans. Preferred myeloma 
cell lines are from mouse, and the most preferred mouse myeloma cell line is 
P3X63-Ag8.653. The animals from whom spleens are obtained after immunization 
are rats, mice and hamsters, preferably mice, most preferably Balb/c mice. Spleen 
cells and myeloma cells are fused using a number of methods well known in the art, 
including but not limited to incubation with inactivated Sendai virus and incubation 
in the presence of polyethylene glycol (PEG). The most preferred method for cell 
fusion is incubation in the presence of a solution of 45% (w/v) PEG-1450. 
Monoclonal antibodies produced by hybridoma cell lines can be harvested from cell 
culture supernatant fluids from in vitro cell growth; alternatively, hybridoma cells 
can be injected subcutaneously and/or into the peritoneal cavity of an animal, most 
preferably a mouse, and the monoclonal antibodies obtained from blood and/or 
ascites fluid. 

Monoclonal antibodies provided by the present invention are also produced 
by recombinant genetic methods well known to those of skill in the art, and the 
present invention encompasses antibodies made by such methods that are 
immunologically reactive with an epitope of a bacterial hemoglobin receptor protein 
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of the invention. The present invention also encompasses fragments, including but 
not limited to F(ab) and F(ab) 2 fragments, of such antibody. Fragments are 
produced by any number of methods, including but not limited to proteolytic 
cleavage, chemical synthesis or preparation of such fragments by means of genetic 
5 engineering technology. The present invention also encompasses single-chain 

antibodies that are immunologically reactive with an epitope of a bacterial 
hemoglobin receptor protein, made by methods known to those of skill in the art. 

The antibodies and fragments used herein can be labeled preferably with 
radioactive labels, by a variety of techniques. For example, the biologically active 
10 molecules can also be labeled with a radionucleotide via conjugation with the cyclic 
anhydride of diethylenetriamine penta-acetic acid (DPTA) or bromoacetyl 
aminobenzyl ethylamine diamine tetra-acidic acid (BABE). See Hnatowich et al. 
(1983, Science 220: 613-615) and Meares et al. (1984, Anal. Biochem. 142: 68-78, 
both references incorporated by reference) for further description of labeling 
15 techniques. 

The present invention also encompasses an epitope of a bacterial hemoglobin 
receptor protein of the invention, comprised of sequences and/or a conformation of 
sequences present in the receptor molecule. This epitope may be naturally occurring, 
or may be the result of proteolytic cleavage of a receptor molecule and isolation of 
an epitope-containing peptide or may be obtained by synthesis of an epitope- 
containing peptide using methods well known to those skilled in the art. The present 
invention also encompasses epitope peptides produced as a result of genetic 
engineering technology and synthesized by genetically engineered prokaryotic or 
eukaryotic cells. 

25 The invention also includes chimeric antibodies, comprised of light chain and 

heavy chain peptides immunologically reactive to a bacterial hemoglobin receptor 
protein-derived epitope. The chimeric antibodies embodied in the present invention 
include those that are derived from naturally occurring antibodies as well as chimeric 
antibodies made by means of genetic engineering technology well known to those of 

30 skill in the art. 

Also provided by the present invention are diagnostic and therapeutic methods 
of detecting and treating an infection in a human, by a pathogenic organisms 



20 
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expressing a bacterial hemoglobin receptor protein. Diagnostic reagents for use in 
such methods include the antibodies, most preferably monoclonal antibodies, of the 
invention. Such antibodies are used in conventional immunological techniques, 
including but not limited to enzyme-linked immunosorbent assay (ELISA), 
radioimmune assay (RIA), Western blot assay, immunological titration assays, 
immunological diffusion assays (such as the Ouchterlony assay), and others known 
to those of skill in the art. Also provided are epitopes derived from a bacterial 
hemoglobin receptor protein of the invention and immunologically cross-reactive to 
said antibodies, for use in any of the immunological techniques described herein. 

Additional diagnostic assays include nucleic acid hybridization assays, using 
the nucleic acids of the invention or specifically-hybridizing fragments thereof, for 
sensitive detection of bacterial genomic DNA and/or mRNA. Such assays include 
various blot assays, such as Southern blots, Northern blots, dot blots, slot blots and 
the like, as well as in vitro amplification assays, such as the polymerase chain 
reaction assay (PCR), reverse transcriptase-polymerase chain reaction assay (RT- 
PCR), ligase chain reaction assay (LCR), and others known to those skilled in the 
art. Specific restriction endonuclease digestion of diagnostic fragments detected 
using any of the methods of the invention, analogous to restriction fragment linked 
polymorphism assays (RFLP) are also within the scope of this invention. 

The invention also provides therapeutic methods and reagents for use in 
treating infections in a human, cause by a microorganism expressing a bacterial 
hemoglobin receptor protein of the invention, most preferably a bacteria of Neisseria 
species. Therapeutic reagents for use in such methods include the antibodies, most 
preferably monoclonal antibodies, of the invention, either per se or conjugated to 
bactericidal or bacteriostatic drugs or other antibiotic compounds effective against the 
infectious microorganism. In such embodiments, the antibodies of the invention 
comprise pharmaceutical compositions, additionally comprising appropriate 
phannaceutically-acceptable carriers and adjuvants or other ancillary components 
where necessary. Suitable carriers are, for example, water, saline, dextrose, 
glycerol, ethanol, or the like and combinations thereof. In addition, if desired, the 
pharmaceutical formulation may contain minor amounts of auxiliary substances such 
as wetting or emulsifying agents, pH buffering agents, or other compounds which 
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enhance the effectiveness of the antibody. In these embodiments, it will be 
understood that the therapeutic agents of the invention serve to target the infectious 
bacteria, either by immunologically "tagging" the bacteria with an antibody of the 
invention for recognition by cytotoxic cells of a human's immune system, or by 
5 specifically delivering an antimicrobial drug to the infectious microorganism via the 

bacterial hemoglobin receptor protein. 

Additional therapeutic reagents include the nucleic acids of the invention or 
fragments thereof, specifically antisense embodiments of such nucleic acids. Such 
antisense nucleic acids may be used themselves or embodied in a recombinant 

10 expression construct specific for antisense expression, wherein said construct is 
genetically engineered to co-opt a portion of the genome of a bacterial virus, 
preferably a bacteriophage, infectious for the bacterial pathogen responsible for the 
infection. In these embodiments, introduction of the antisense nucleic acids of the 
invention into the bacterial cell inhibits, attentuates or abolishes expression of the 

15 bacterial hemoglobin receptor, thereby reducing the virulence of the bacterial 

infection and enabling more effective antibacterial interventions. In additional 
embodiments, bacteriophage are provided bearing "knockout" copies of a bacterial 
hemoglobin receptor gene, whereby the phage achieves genetic mutation of the 
endogenous hemoglobin receptor gene in the infectious bacteria via, for example, 

20 homologous recombination of the exogenous knockout copy of the bacterial 

hemoglobin receptor gene with the endogenous hemoglobin receptor gene in the 
infectious microorganism. 

The Examples which follow are illustrative of specific embodiments of the 
invention, and various uses thereof. They set forth for explanatory purposes only, 
25 and are not to be taken as limiting the invention. 

EXAMPLE 1 
Plasmids. bacteria, and media 
Plasmids and bacteria used herein are listed on Table 1. E. coli strains were 
30 routinely grown in Luria-Bertani (LB) broth supplemented with 5-aminolevulinic acid 
and 50mg/L hemin chloride as necessary. N. meningitidis 8013 is a serogroup C 
clinical isolate (Nassif et al. , 1993, MoL Microbiol. 8: 719-725). The meningococci 
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were routinely grown on GCB agar (Difco) supplemented as described by Kellogg 
et al. (1963, /. Bacteriol 85: 1274-1279), and incubated at 37°C under a 5% C0 2 
atmosphere. Transformation of meningococci was performed as described by Nassif 
et al. (1992, MoL Microbiol. 6: 591-597). When necessary, the following antibiotics 
were used with E. coli: rifampicin, 100 mg/L; tetracycline, 15 mg/L; kanamycin, 
30 mg/L; chloramphenicol, 20 mg/L; carbenicillin, 100 mg/L. For Neisseriae, 
kanamycin at 100 mg/L was used when needed. 

EXAMPLE 2 

Auxotroph Complementation Cloning of a hemoglobin Receptor Gene from 

Neisser ia menintftiAU 

In order to identify N. meningitidis outer membrane receptor(s) involved in 
the uptake of haemin and/or haemoglobin iron, an auxotroph complementation 
cloning strategy was used, similar to the approach previously taken to identify the 
Y. enterocolitica and V. cholerae hemin receptors (see Stojiljkovic and Hantke, 1992, 
EMBO J. 11: 4359-4367; Henderson and Payne, 1994, /. Bacteriol. 176: 3269- 
3277). This strategy is based on the fact that the outer membrane of Gram-negative 
bacteria is impermeable to hemin (McConville and Charles, 1979, J. Microbiol. 113 : 
165-168) and therefore E. coli porphyrin biosynthesis mutants cannot grow on 
exogenously supplied hemin. If provided with the N. meningitidis outer membrane 
hemin receptor gene, the E. coli porphyrin mutant would be able to use exogenously 
supplied hemin as its porphyrin source. 

A cosmid bank of AT. meningitidis 8013 clone 6 DNA was prepared using 
conventional cosmid cloning methodologies (Sambrook et aL, 1989, ibid.). N. 
meningitidis bacterial DNA was partially digested by Mbol, size fractionated on 
sucrose gradients and cloned into the BamUl site of the cosmid vector pLAFR2 
(Riboli et aL, 1991, Microb. Pathogen. 10: 393-403). This cosmid bank was 
mobilized into the E. coli hemA aroB Rif r recipient strain by triparental matings 
using a conjugal plasmid pRK2013: :Tn9. The mating mixture was plated onselective 
plates containing hemin chloride (50mg/L), 0.1 mM 2,2 / -dypyridil and rifampicin 
(100 mg/L). Several clones growing on exogenously supplied haemin were isolated 
after an overnight incubation. 
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TABLE I 



10 



15 



20 



25 



30 



STRAIN 


GENOTYPE 


E. coli K12 




EB53 


hemA, aroB, rpoB 


KP1041 


MC4100tonB::Km r 


H1388 


exbB::TnlO Alac pro 


TSM348 


endA, hsdR, pro, supF, pRK2013::Tn9 


IR754 


EB53, tonB::Km r 


IR736 


EB53, exbB::TxdO 


DH5a 


recA, gyrB 


N. meningitidis 




AlCv^ 13077 


Serotype A 




Serotype B* 


MC8013 


clone 6, wild type 


MChmbR 


hmbR::aphA-3 


N. gonorrhoeae MS11A 




PLASMIDS 




pSUSK 


pA15 replicon, chloramphenicol' 


pHEM22 


pLAFR2, hemoglobin-utilizing cosmid 


pHEM44 


pLAFR2, hemin-utilizing cosmid 


pIRS508 


6kb Clal, pSUSK 


pIRS523 


3kb BamHl/Sall, pUC19 


pIRS525 


1.2kb aphA-3, in Notl site of pIRS523 


pIRS527 


4kb BammiClal, pBluescript 


pIRS528 


0.7kb NotVBamHl, pBluescript 


pIRS692 


3.3kb BammiHiniSm, SU(SK) 



Laboratory collection 
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The hemin utilization phenotype of these transformants was tested by re- 
introduction of the cosmids into naive E. coli hemA aroB cells and by monitoring the 
growth on hemin-supplemented plates. The ability of E. coli strains to utilize heme 
or hemoglobin as the sole iron source was tested as previously described (Stojiljkovic 
5 and Hantke, 1992, ibid.). Cells were grown on LB agar supplemented with 50 M M 

deferoxamine mesylate (an iron chelating agent, obtained from Sigma Chemical Co. , 
St. Louis, MO). Filter discs (1/4 inches, Schleichner & Schuell, Inc., Keene, NH.) 
impregnated with the test compounds (20 M L of 5 mg/ml stock solutions unless 
otherwise stated) were placed on these plates. After overnight growth at 37°C with 
5 % C0 2 , zones of growth around the discs were monitored. The iron-bound proteins 
tested in this assay (all obtained from Sigma Chemicals Co.) were hemoglobin from 
human, baboon, bovine and mouse sources, bovine hemin, human lactoferrin (90% 
iron saturated), and human transferrin (90% iron saturated, obtained from Boehringer 
Mannheim Biochemicals, Indianapolis, IN). A total of six hemin utilization positive 
15 cosmids were obtained using this protocol. Results using such assays are shown in 

Table II. 



EXAMPLE 3 

_ _ Restriction Enzyme Digestion Mapping of Hemin Utilization 

20 Positive Cnsmiris 

Cosmid DNA from six hemin-utilization positive cosmids obtained as 

described in Example 2 were digested with Ctol, and the resulting fragments were 

cloned into CM-digested pSU(SK) vector (obtained from Stratagene, LaJolla, CA). 

One subclone, containing a 6 kb Cla\ fragment from cosmid cos22 (the resultant 

25 plasmid was designated pIRS508), was determined to allow utilization of hemin and 

hemoglobin by E. coli hemA aroB assayed as described in Example 2. Another such 
clone, containing an 11 kb Clal fragment from cos44 was also determined to allow 
hemin utilization in these auxotrophic mutant cells. Restriction analysis and Southern 
hybridization indicated that the DNA fragments originating from cos22 and cos44 are 

30 unrelated. 

The deduced restriction enzyme digestion map of cosmid clone pIRS508 is 
shown in Figure 1 Plasmid pIRS508 enabled E. coli hemA aroB to use both hemin 
and bovine hemoglobin as iron sources although growth on hemoglobin was 
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somewhat weaker than on hemin (Table II). Further subcloning localized the 
hemin/hemoglobin utilization locus to the Bamlil/Hindlll fragment of the insert. In 
addition to sequences encoding the hemoglobin receptor gene (designated hmbR), 
sequences for a Neisseria insertion element (IS1106) and a portion of a Neisseria 
5 small repetitive element (IR7) are also represented in the Figure. 



EXAMPLE 4 

Nucleotide Sequence Analysis of a Cosmid Clone Encoding 
a Neisseria Hemoglobin Receptor Gene 

The nucleotide sequence of the 3.3 kb BamHhHindlll DNA fragment 

carrying the hmbR gene and its promoter region was determined using the dideoxy 

chain termination method using a Sequenase 2.0 kit (obtained from U.S. 

Biochemicals, Cleveland, OH) and analyzed using a BioRad electrophoresis system, 

an AutoRead kit (obtained from Pharmacia, Uppsala, SE) and an ALF-370 automatic 

sequenator (Pharmacia, Uppsala, Sweden). Plasmid subclones for sequencing were 

produced by a nested deletion approach using Erase-a-Base kit (obtained from 

Promega Biotech, Madison, WI) using different restriction sites in the hmbR gene. 

The nucleotide and predicted amino acid sequences of the hmbR gene are shown in 

Figure 2 

An open reading frame (ORF) encoding the N. meningitidis, serotype C 
hemoglobin receptor protein begins at position 470 of the sequence and encodes a 
protein having an amino acid sequence of 792 amino acids, with a calculated 
molecular weight of 85.5 kDa. A Shine-Delgarno sequence (SD) is found at position 
460. The HmbR receptor protein contains a signal peptidase I recognition sequence 
at residues 22 to 24 of the protein (underlined) , consistent with the fact that it is an 
outer membrane protein. 

A typical Fur binding nucleotide sequence (designated "Fur box") was found 
in the promoter region of the hmbR gene (Figure 2). Like hemin utilization in 
Yersiniae and Vibrio, hemin and hemoglobin utilizati n in Neisseria are known to be 
iron-inducible phenotypes (West and Sparling, 1985, Infect. Immun. 47: 388-394; 
Dyer et aL 9 1987, Infect. Immun. <£: 2171-2175). In Gram-negative bacteria, 
conditional expression of many iron utilization genes is regulated by the Fur 
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repressor, which recognizes a 19 bp imperfect dyad repeat (Fur- box) in the promoter 
regions of Fur-repressed genes. Recently, a genetic screen (FURTA) for the 
identification of Fur-regulated genes from different Gram-negative bacteria was 
described (Stojiljkovic etaL, 1994, /. MoL Biol. 236: 531-545), and this assay was 
5 used to test whether hmbR expression was controlled in this way. Briefly, a plasmid 

carrying a Fur-box sequence is transformed into an E. coli strain (H1717) which 
possesses a Fur-regulated lac fusion in the chromosome. Expression of this Fur- 
regulated lac fusion is normally repressed. Introduction of a multicopy Fur-box 
sequence on the plasmid titrates the available Fur repressor thus allowing expression 

10 of the Fur-regulated lac fusion (this phenotype is termed FURTA positive). Using 

this screen, the smallest insert fragment from cosmid pIRS508 that produced a 
FURTA positive result was a 0.7 kb BamHl-Notl DNA fragment carried on plasmid 
pIRS528 (see Figure 1). This result indicated that the 0.7 kb BamHl-Notl fragment 
carries a Fur-box and that gene expression from the hmbR promoter is controlled by 

15 a fur-type operon. 

N. meningitidis, serotype C hemoglobin receptor protein was expressed in 
vitro using an £. coli S30 extract system from Promega Biotech (Madison, WI). The 
3.3 kb BamHl-Hindni fragment, expressed in vitro, encoded a 90kDa protein which 
corresponds in size to the predicted molecular weight of the unprocessed HmbR 

20 receptor. SDS/ 10% PAGE analysis showing the observed M r of 90K is shown in 

Figure 3. 

Immediately downstream of the hmbR gene (at positions 2955 to 3000 bp in 
Figure 2) was found a short nucleotide sequence that is 99% identical to the flanking 
sequence of the PHI gene of N. gonorrhoeae (Gotschlich et aL , 1987, J. Exp. Med. 

25 1£2: 471-482). The first 26 bp of this sequence represents one half of the inverted 

repeat (IR1) of the N. gonorrhoeae small repetitive element. This element is found 
in approximately 20 copies in both N. gonorrhoeae and N. meningitidis (Correia et 
aL, 1988, J. Biol. Chem. 263: 12194-12198). The analysis of the nucleotide 
sequence from position 3027 to the Clal (3984) restriction site (only the nucleotide 

30 sequence from BamiU (1) to HinSSSL (3370) is shown in Figure 2) indicated the 

presence of an IS1 106 element (Knight et aL , 1992, MoL Microbiol. 6: 1565-1573). 
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Interestingly, no nucleotide sequence similar to the IS 1106 inverted repeat was found 
between the IR1 element and the beginning of the homology to IS 1106. 

These results were consistent with the cloning and identification of a novel 
hemoglobin receptor protein gene from N. meningitidis, embodied in a 3.3kb 
BamHl/HindUI fragment of N. meningitidis genomic DNA. 

EXAMPLE 5 

Amino Acid Sequence Comparison of the N. meningitidis 
Hemoglobin Receptor Protein and Neisseria 
Lactoferrin and Transferrin Receptor Proteins 

A comparison of the transferrin (Tbpl; Legrain et aL , 1993, Gene 130: 81- 

90), lactoferrin (LbpA; Pettersson et al, 1993, Infect. Immun. 61: 4724-4733, and 

1994, /. BacterioL 176: 1764-1766) and hemoglobin receptors (HmbR) from N. 

meningitidis is shown in Figure 4. The comparison was done with the CLASTAL 

program from the PC/GENE program package (Intelligenetics, Palo Alto, CA). 

Only the amino-terminal and carboxyl terminal segments of the proteins are shown. 

An asterisk indicates identity and a point indicates similarity at the amino acid level. 

Lactoferrin and transferrin receptors were found to share 44.4% identity in amino 

acid sequence. In contrast, homology between these proteins and the hemoglobin 

receptor disclosed herein was found to be significantly weaker (22% amino acid 

sequence identity with lactoferrin and 21% with transferrin receptor). 

EXAMPLE 6 

TonB/ExbBD-Dependence of Hemin Transport by the N. meningitidis 

Hemoglobin Receptor 

It was known that the transport of iron-containing siderophores, some colicins 

and vitamin B12 across the outer membrane of E. coli depends on three cytoplasmic 

membrane proteins: TonB, ExbB and ExbD (Postle 1990, MoL Microbiol. 133: 891- 

898; Braun and Hantke, 1991, in Winkelmann, (ed.), Handbook of Microbial Iron 

Chelates, CRC Press, Boca Raton, Fla., pp. 107-138). In Yersinia and Hemophilus, 

hemin uptake was shown to be a TonB-dependent process (Stojiljkovic and Hantke, 

1992, ibid.; Jarosik et aL, 1994, Infect. Immun. £2: 2470-2477). Through direct 

interaction between the outer membrane receptors and the TonB cytoplasmic 
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machinery, the substrate bound to the receptor is internalized into the periplasm 
(Heller et al, 1988, Gene 64: 147-153; Schoffler and Braun, 1989, Molec. Gen. 
Genet. 217: 378-383). This direct interaction has been associated with a particular 
amino acid sequence in membrane proteins associated with the TonB machinery. 

All TonB-dependent receptors in Gram-negative bacteria contain several 
regions of high homology in their primary structures (Lundrigan and Kadner, 1986, 
7. Biol Chem. 261: 10797-10801). In the amino acid sequence comparison 
described in Example 5, putative TonB-boxes of all three proteins are underlined. 
The carboxyl terminal end of the HmbR receptor contains the highly conserved 
terminal phenylalanine and position 782 arginine residues thought to be part of an 
outer membrane localization signal (Struyve et al , 1991 , /. Mol Biol 218: 141-148; 
Koebnik, 1993, Trends Microbiol 1: 201). At residue 6 of the mature HmbR 
protein, an amino acid sequence - ETTPVKA - is similar in sequence to the so called 
TonB-boxes of several Gram-negative receptors (Heller et al, 1988, ibid.). 
Interestingly, the putative TonB-box of HmbR has more homology to the TonB-box 
of the N. gonorrhoeae transferrin receptor (Cornelissen et al , 1992, 7. Bacteriol 
174: 5788-5797) than to the TonB-boxes of E. coli siderophore receptors. When the 
sequence of the HmbR receptor was compared with other TonB-dependent receptors, 
the highest similarity was found with Y. enterocolitica HemR receptor although the 
similarity was not as high as to the Neisseria receptors. 

In order to prove the TonB-dependent nature of the N. meningitidis, serotype 
C hemoglobin receptor, hmbR was introduced into exbB and tonB mutants of E. coli 
EB53, and the ability of the strains to utilize hemin and hemoglobin as porphyrin and 
iron sources was assessed. In these assays, both mutants of E. coli EB53 were 
unable to use hemin either as a porphyrin source or as an iron source in the presence 
of a functional hmbR (Table 2). The usage of hemoglobin as an iron source was also 
affected (Table 2). These results are consistent with the notion that the hmbR gene 
product, the N. meningitidis hemoglobin receptor protein of the invention, is TonB- 
dependent, since expression of this gene in TonB wild type E. coli supported the use 
of hemin and hemoglobin as sole iron source in the experiments disclosed in 
Example 2. 
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EXAMPLE 7 

Functional Demonstration that the hmbR Gene Product is the 
Hemoglobin Receptor Protein in meningitidis 

As shown in the data presented in Table II, hmbR mediated both hemin and 

hemoglobin utilization when expressed in E. coli, but hemoglobin utilization was less 

vigorous than hemin utilization. To determine if the HmbR receptor has the same 

specificity in N. meningitidis, hmbR was inactivated with a 1 .2kb kanamycin cassette 

(aphA-3; Nassif et al., 1991, ibid.) and transformed into wild-type N. meningitidis 

8013 clone 6 (serotype C) cells. The inactivation of the chromosomal hmbR copy 

of the Km-resistant transformants was confirmed by Southern hybridization, as 

shown in Figure 5. As can be seen from Figure 5, wild-type N. meningitidis 

genomic DNA contains only one copy of the hmbR gene (lanes 1 and 3). In the Km r 

transformants, the size of the DNA fragments containing the wild-type gene has 

increased by 1 .2 kb, which is the size of the Kan cassette (Figure 5, lanes 2 and 4). 

When tested for its ability to utilize different iron-containing compounds, these 

mutant cells were found to be unable to use hemoglobin-bound iron, regardless of 

the source (human, bovine, baboon, mouse). The ability of the mutant to utilize 

hemoglobin-haptoglobin was not tested because the wild-type N. meningitidis strain 

is unable to use haptoglobin-haemoglobin complex as an iron source. However, the 

mutant was still able to use hemin iron, lactoferrin- and transferrin-bound iron as 

well as citrate-iron (Table II). As the iron-containing component of hemoglobin is 

hemin, a hemoglobin receptor would be expected to be capable of transporting hemin 

into the periplasm. Indeed, the cloning strategy disclosed herein depended on the 

ability of the cloned meningococcal receptor to transport hemin into the periplasm 

of E. coli. These results strongly suggest that N. meningitidis has at least two 

functional receptors that are involved in the internalization of hemm-containing 

compounds. One is the hemoglobin receptor described herein, which allows the 

utilization of both hemin and hemoglobin as iron sources. The other putative 

receptor in N. meningitidis is a hemin receptor which allows utilization of only 

hemin. This schema is also consistent with the isolation of several cosmid clones 

that allow E. coli EB53 to utilize hemin. DNAs from these cosmids do not hybridize 

with our hmbR probe , indicating that these clones encode a structurally-distinct 
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receptor protein capable of transporting hemin into the periplasm of N, meningitidis 
cells. 

EXAMPLE 8 

Attenuation nf Virulence i n hmbR Mutant N. meningitidis Cells In Vivn 
In order to test the importance of hemoglobin and hemin scavenging systems 
of N. meningitidis in vivo, the hmbR -mutant and the wild type strain of N. 
meningitidis, serotype C were inoculated into 5 day old infant rats and the numbers 
of bacteria recovered from blood and cerebrospinal fluid were followed. In these 
experiments, the method for the assessing N. meningitidis, serotype C virulence 
potential was essentially the same as described by Nassif et al. (1992, ibid.) using 
infant inbred Lewis rats (Charles River, Saint Aubin les Elbeufs, France). Inbred 
rats were used to minimize individual variations. Briefly, the 8013 strain was 
reactivated by 3 animal passages. After the third passage, bacteria were kept frozen 
in aliquots at -80° C. To avoid the possibility that modifications in the course of 
infection could result from selection of one spontaneous avirulent variant, one aliquot 
from the animal-passed frozen stock of 8013 was transformed with chromosomal 
DNA from the hmbR mutant, the resultant Kan r transformants were pooled without 
further purification and kept frozen at -80°C. For each experiment, all infant rats 
were from the same litter. N. meningitidis 8013 was grown overnight and 2 X 10 6 
bacteria injected intraperitoneally into the infant rat. Three rats were used for each 
meningococcal strain. The course of infection was followed over a 24 hours time 
period with blood collected at the indicated times. At the 24 h time period, the rats 
were sacrificed, the cerebrospinal fluid (CSF) collected and the number of colony- 
forming units (CFU) determined. Each experiment was performed in replicate; 
similar results were obtained both times. 

The results of these experiments are shown in Figure 6. The hmbR ' strain, 
which is unable to use hemoglobin as an iron source, was recovered from the blood 
of infected animals in significantly lower numbers when compared with the wild type 
strain. Both the mutant and the wild type strain were, still able to cross the blood- 
brain barrier as indicated by the isolation of bacteria from the cerebrospinal fluid. 
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These results indicate that hemoglobin represents an important iron source for N. 
meningitidis during growth in vivo. 

EXAMPLE 9 

Polymerase Chain Reaction Amplification of Hemoglobin Receptor 
Genes from N. menin gitidis Serotypes and N. gonorrhoeae 

From the nucleotide sequence of the 3.3 kb BamHl-Hindm DNA fragment 
carrying the hmbR gene and its promoter region was determined specific 
oligonucleotide promers for in vitro amplification of the homologous hemoglobin 
receptor protein genes from N. meningitidis serotypes A and B and N. gonorrhoeae 
MSI 1 A as follows. 

The following oligonucleotide primers were developed for in vitro 
amplificaiton reactions using the polymerase chain reaction (PCR; Saiki et al. , 1988, 
Science 230: 1350-1354): 

5 -AAACAGGTCTCGGCATAG-3 ' (sense primer) (SEQ ID No. : 1 1) 

5 ' -CGCGAATTC AAACAGGTCTCGGCATAG-3 ' (SEQ ID No. : 12) 

(antisense primer) 

for amplifying the hemoglobin receptor protein from N. meningitidis, serotype A; 

5 ' -CGCG AATTC AAAAACTTCC ATTCC AGCG ATACG-3 ' (SEQ ID No : 13) 
(sense primer) 

5 '-TAAAACTTCCATTCCAGCGATACG-3 ' (antisense primer) (SEQ ID No.: 14) 

for amplifying the hemoglobin receptor protein from N. meningitidis, serotype B; 

5 '-AAACAGGTCTCGGCATAG-3' (sense primer) (SEQ ID No. : 15) 

or 

5 ' -CGCG AATTC AAACAGGTCTCGGCATAG-3 ' (SEQ ID No. : 16) 

(sense primer) 

and 

5 '-CGCGAATTC AAAAACTTCC ATTCC AGCGATACG-3 ' (SEQ ID No.:17) 
(antisense primer) 

or 

5 '-TAAAACTTCCATTCCAGCGATACG-3 ' (antisense primer) (SEQ ID No.: 18) 
for amplifying the hemoglobin receptor protein from N. gonorrhoeae MS11A. 

Genomic DNA from N. meningitidis serotype A or B or N. gonorrhoeae 
species was prepared using standard techniques (see Sambrook, et al., ibid.), 
including enzymatic degradation of bacterial cell walls, protoplast lysis, protease and 
RNase digestion, extraction with organic solvents such as phenol and/or chloroform, 
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and ethanol precipitation. Crude DNA preparations were also used. An amount 
(typically, about 0. 1/xg) of genomic DNA was used for each amplification reaction. 
A PCR amplification reaction consisted of Pfu polymerase (Stratagene, LaJolla, CA) 
and/or Tag polymerase (Boehringer Mannheim, Germany) in the appropriate buffer 
including about 20picomoles of each amplificaiton primer and 200nanomoles of each 
deoxynucleoside triphosphate. Amplification reactions were performed according to 
the following scheme: 

First cycle 5 min at 95 °C 

2minat51°C 
6 min at 72°C 

Cycles 2-13 45 sec at 95 °C 

35 sec at 49°C 
10 min at 72°C 



Cycles 14-30 25 sec at 95 °C 

35 sec at 47 °C 
10 min at 72 °C 

Upon completion of the amplification reaction, DNA fragments were cloned either 
blunt-ended or, after EcoW digestion, into EcoEl digested pSUKS or pWKS30 
vectors and transformed into bacteria. Positively-selected clones were then analyzed 
for the presence of recombinant inserts, which were sequenced as described above 
in Example 4. 

As a result of these experiments, three clones encoding the hemoglobin 
receptor genes from N. meningitidis serotypes A and B and N. gonorrhoeae MS11A 
were cloned and the sequence of these genes determined. The nucleic acid sequence 
for each of these genes are shown in Figures 7 (N. meningitidis, serotype A), 8 (N. 
meningitidis, serotype A) and 9 (AT. gonorrhoeae MS 11 A). 

The degree of homology between the cloned hemoglobin receptors from the 
different N. meningitidis serotypes and N. gonorrhoeae MS11A was assessed by 
nucleic acid and amino acid sequence comparison, as described in Example 5 above. 
The results of these comparisons are shown in Figures 10 and 11, respectively. 
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Hemoglobin receptor genes from the three N. meningitidis serotypes and N. 
gonorrhoeae MS11A were found to be from 86.5% to 93.4% homologous; the most 
homologous nucleic acids were N. meningitidis serotypes B and C, and the most 
divergent nucleic acids were N. meningitidis serotype B and N. gonorrhoeae MS 1 1 A 
5 (Figure 10 and Table III). Homoglobin receptor proteins from all four Neisseria 

species showed a high degree of homology to the other members of the group, 
ranging from 87% homology between the hemoglobin receptor proteins from N. 
gonorrhoeae MS11A and N. meningitidis serotype B to 93% homology between 
hemoglobin receptor proteins from N. meningitidis serotypes A and B (Figure 11). 
10 In this comparison, all four receptors were found to share 84.7% amino acid 

sequence identity, and up to 11.6% sequence similarity (i.e., chemically-related 
amino acid residues at homologous sites within the amino acid sequence). The non- 
conserved amino acids were found clustered in the regions of the amino acid 
sequence corresponding to the external loops in the predicted topographical structure 
15 of the hemoglobin receptor proteins. 

It should be understood that the foregoing disclosure emphasizes certain 
specific embodiments of the invention and that all modifications or alternatives 
equivalent thereto are within the spirit and scope of the invention as set forth in the 
appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: 

f«! c^^° r ! g0n Health Sciences University 

(D) STATE: Oregon 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 97201-3098 

(G) TELEPHONE: 503-494-8200 

(H) TELEFAX: { 503 ) -494 -4729 

(ii) TITLE OF INVENTION: A Novel Rarh^ i u , * 

and Uses 1 Bacte ^ial Hemoglobin Receptor 

(iii) NUMBER OF SEQUENCES: 18 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

SOFTWARE: Patentln Release #i.o, Version #1.25 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US95 / 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 23 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..2373 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

s & ss si si? s ss s s s s? s« s» s 
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TTC GGC AAT CCG GTC TTG GCA GCA GAT GAA GCT GCA ACT GAA ACC ACA 96 
Phe Gly Asn Pro Val Leu Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

CCC GTT AAG GCA GAG ATA AAA GCA GTG CGC GTT AAA GGT CAG CGC AAT 144 
Pro Val Lys Ala Glu He Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 

GCG CCT GCG GCT GTG GAA CGC GTC AAC CTT AAC. CGT ATC AAA CAA GAA 192 
Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg He Lys Gin Glu 
50 55 60 

ATG ATA CGC GAC AAT AAA GAC TTG GTG CGC TAT TCC ACC GAT GTC GGC 240 
Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

TTG AGC GAC AGC GGC CGC CAT CAA AAA GGC TTT GCT GTT CGC GGC GTG 288 
Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

GAA GGC AAC CGT GTC GGC GTG AGC ATA GAC GGT GTA AAC CTG CCT GAT 336 
Glu Gly Asn Arg Val Gly Val Ser He Asp Gly Val Asn Leu Pro Asp 
100 105 110 

TCC GAA GAA AAC TCG CTG TAC GCC CGT TAT GGC AAC TTC AAC AGC TCG 3 84 

Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGT TTG TCT ATC GAC CCC GAA CTC GTA CGC AAT ATT GAA ATC GTG AAG 4 32 

Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Glu He Val Lys 
130 135 140 

GGC GCA GAC TCT TTC AAT ACC GGC AGT GGT GCA TTG GGC GGC GGT GTG 48 0 

Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
"5 150 155 160 

AAT TAC CAA ACG CTG CAA GGC CGT GAT TTG CTG TTG GAC GAC AGG CAA 528 
Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Asp Asp Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAC AGC ACG CGT AAC CGT GAA TGG 576 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 190 

ACA AAT ACC CTC GGT TTC GGT GTG AGT AAC GAC CGC GTG GAT GCT GCT 624 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGG CGC GGC CAT GAA ACC GAA AGC GCG GGC AAC 672 
Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Asn 
210 215 220 

CGC GGC TAT CCG GTA GAA GGT GCG GGT AAA GAA ACG AAT ATC CGC GGT 72 0 

Arg Gly Tyr Pro Val Glu Gly Ala Gly Lys Glu Thr Asn He Arg Gly 
225 230 235 240 

TCC GCC CGC GGC ATC CCC GAT CCG TCC AAA CAC AAA TAC CAC AAC TTC 768 
Ser Ala Arg Gly He Pro Aap Pro Ser Lys His Lys Tyr His Asn Phe 
245 250 255 

TTG GGT AAG ATT GCT TAT CAA ATC AAC GAC AAC CAC CGC ATC GGC GCA 816 
Leu Gly Lys He Ala Tyr Gin He Asn Asp Asn His Arg He Gly Ala 
260 265 270 

TCG CTC AAC GGT CAG CAG GGG CAT AAT TAC ACG GTT GAA GAG TCT TAT 864 
Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 
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AAC CTG ACC GCT TCT TCC TGG CGC GAA GCC GAT GAC GTA AAC AGA CGG 
Asn Leu Thr Ala Ser Ser Trp Arg Glu Ala Asp Asp Val Asn Arg Ara 
290 295 300 

CGC AAT GCC AAC CTC TTT TAC GAA TGG ATG CCT GAT TCA AAT TGG TTG 
Arg Asn Ala Asn Leu Phe Tyr Glu Trp Met Pro Asp Ser Asn Trp Leu 
305 310 315 3 2 o 

TCG TCT TTG AAG GCG GAC TTC GAT TAT CAG AAA ACC AAA GTG GCG GCG 
Ser Ser Leu Lys Ala Asp Phe Asp Tyr Gin Lys Thr Lys Val Ala Ala 
325 330 335 

ATT AAC AAA GGT TCG TTC CCG ACG AAT TAC ACC ACA TGG GAA ACT GAG 
He Asn Lys Gly Ser Phe Pro Thr Asn Tyr Thr Thr Trp Glu Thr Glu 
340 345 350 

TAC CAT AAA AAG GAA GTT GGC GAA ATA TAC AAC CGC AGC ATG GAC ACC 
Tyr His Lys Lys Glu Val Gly Glu He Tyr Asn Arg Ser Met Asp Thr 
35 5 360 365 

CGA TTC AAA CGT TTT ACT TTG CGT TTG GAC AGC CAT CCG TTG CAA CTC 
Arg Phe Lys Arg Phe Thr Leu Arg Leu Asp Ser His Pro Leu Gin Leu 
370 375 380 

GGG GGG GGG CGA CAC CGC CTG TCG TTT AAA ACT TTC GCC AGC CGC CGT 
Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Ser Ara Ara 
385 390 395 y 400 

GAT TTT GAA AAC CTA AAC CGC GAC GAT TAT TAC TTC AGC GGC CGT GTT 
Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Gly Ara Val 
405 410 4il 

GTT CGA ACC ACC AGC AGT ATC CAG CAT CCG GTG AAA ACC ACC AAC TAC 
Val Arg Thr Thr Ser Ser He Gin His Pro Val Lys Thr Thr Asn Tyr 
420 425 430 

GGT TTC TCA CTG TCT GAC CAA ATT CAA TGG AAC GAC GTG TTC AGT AGC 
Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe Ser Ser 
435 440 445 

CGC GCA GGT ATC CGT TAC GAC CAC ACC AAA ATG ACG CCT CAG GAA TTG 
Arg Ala Gly He Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu Leu 
45 0 455 460 

AAT GCC GAG TGT CAT GCT TGT GAC AAA ACA CCA CCT GCA GCC AAC ACT 144 0 

Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn Thr 
465 470 475 480 

TAT AAA GGC TGG AGC GGT TTT GTC GGC TTG GCG GCG CAA CTG AAT CAG 
Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Asn Gin 
485 490 495 

GCT TGG CGT GTC GGT TAC GAC ATT ACT TCC GGC TAC CGT GTC CCC AAT 1536 
Ala Trp Arg Val Gly Tyr Asp He Thr Ser Gly Tyr Arg Val Pro Asn 
500 505 510 

GCG TCC GAA GTG TAT TTC ACT TAC AAC CAC GGT TCG GGT AAT TGG CTG 1584 
Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn Trp Leu 
515 520 525 

CCC AAT CCC AAC CTG AAA GCC GAG CGC AGC ACC ACC CAC ACC CTG TCT 1632 
Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu Ser 
530 535 540 

CTG CAA GGC CGC AGC GAA AAA GGC ATG CTG GAT GCC AAC CTG TAT CAA 1680 
Leu Gin Gly Arg Ser Glu Lys Gly Met Leu Asp Ala Asn Leu Tyr Gin 
545 550 555 560 
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AGC AAT TAC CGC AAT TTC CTG TCT GAA GAG CAG AAG CTG ACC ACC AGC 172 8 

Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr Thr Ser 
565 570 575 

GGC ACT CCC GGC TGT ACT GAG GAA AAT GCT TAC TAC AGT ATA TGC AGC 1776 
Gly Thr Pro Gly Cys Thr Glu Glu Asn Ala Tyr Tyr Ser He Cys Ser 
580 585 590 

GAC CCC TAC AAA GAA AAA CTG GAT TGG CAG ATG AAA AAT ATC GAC AAG 1824 
Asp Pro Tyr Lys Glu Lys Leu Asp Trp Gin Met Lys Asn He Asp Lys 
595 600 605 

GCC AGA ATC CGC GGT ATC GAG CTG ACA GGC CGT CTG AAT GTG GAC AAA 1872 
Ala Arg He Arg Gly He Glu Leu Thr Gly Arg Leu Asn Val Asp Lys 
610 615 620 

GTA GCG TCT TTT GTT CCT GAG GGC TGG AAA CTG TTC GGC TCG CTG GGT 192 0 

Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu Glv 
625 630 635 640 

TAT GCG AAA AGC AAA CTG TCG GGC GAC AAC AGC CTG CTG TCC ACA CAG 1968 
Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr Gin 
645 650 655 

CCG CTG AAA GTG ATT GCC GGT ATC GAC TAT GAA AGT CCG AGC GAA AAA 2 016 

Pro Leu Lys Val He Ala Gly He Asp Tyr Glu Ser Pro Ser Glu Lys 
660 665 670 

TGG GGC GTA TTC TCC CGC CTG ACC TAT CTG GGC GCG AAA AAG GTC AAA 2064 
Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Val Lys 
675 680 685 

GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC TGG GGT ACG CCT TTG 2112 
Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr Pro Leu 
690 695 700 

CAG AAA AAG GTA AAA GAT TAC CCG TGG CTG AAC AAG TCG GCT TAT GTG 2160 
Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr Val 
705 710 715 720 

TTC GAT ATG TAC GGC TTC TAC AAA CCG GTG AAA AAC CTG ACC CTG CGT 22 0 8 

Phe Asp Met Tyr Gly Phe Tyr Lys Pro Val Lys Asn Leu Thr Leu Arg 
725 730 735 

GCG GGC GTG TAC AAC CTG TTC AAC CGC AAA TAC ACC ACT TGG GAT TCC 2256 
Ala Gly Val Tyr Asn Leu Phe Asn Arg Lys Tyr Thr Thr Trp Asp Ser 
740 745 750 

CTG CGC GGT TTA TAT AGC TAC AGC ACC ACC AAT GCG GTC GAC CGC GAT 23 04 

Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arg Asp 
755 760 765 

GGC AAA GGC TTA GAT CGC TAC CGC GCC CCA GGC CGC AAT TAC GCC GTA 23 52 

Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Gly Arg Asn Tyr Ala Val 
770 775 780 

TCG CTG GAA TGG AAG TTT TAA 2373 
Ser Leu Glu Trp Lys Phe * 
785 790 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Lys Pro Leu Gin Met Leu Pro lie Ala Ala Leu val Gly Ser He 
1 5 10 15 

Phe Gly Asn Pro Val Leu Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

Pro Val Lys Ala Glu He Lys Ala Val Arg Val Lys Gly Gin Arg Asn 

40 45 

Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg He Lys Gin Glu 



60 



Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Glv 
65 70 75 si 

Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

Glu Gly Asn Arg Val Gly Val Ser He Asp Gly Val Asn Leu Pro Asp 
100 105 110 * 

Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Glu He Val Lys 



140 



Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 

150 155 iso 

Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Asp Asp Arg Gin 
165 170 175 

Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 19 o * 

Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Asn 



220 



Arg Gly Tyr Pro Val Glu Gly Ala Gly Lys Glu Thr Asn He Arg Gly 

230 235 240 

Ser Ala Arg Gly He Pro Asp Pro Ser Lys His Lys Tyr His Asn Phe 
24S 250 255 

Leu Gly Lys lie Ala Tyr Gin He Asn Asp Asn His Arg lie Gly Ala 
260 265 270 

Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 

Asn Leu Thr Ala Ser Ser Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

Arg Asn Ala Asn Leu Phe Tyr Glu Trp Met Pro Asp Ser Asn Trp Leu 
305 31° 315 " 320 
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Ser Ser Leu Lys Ala Asp Phe Asp Tyr Gin Lys Thr Lys Val Ala Ala 
325 330 335 

He Asn Lys Gly Ser Phe Pro Thr Asn Tyr Thr Thr Trp Glu Thr Glu 
340 345 350 

Tyr His Lys Lys Glu Val Gly Glu He Tyr Asn Arg Ser Met Asp Thr 
355 360 365 

Arg Phe Lys Arg Phe Thr Leu Arg Leu Asp Ser His Pro Leu Gin Leu 
370 375 380 

Gly Gly Gly Arg His Arg Leu Ser Phe. Lys Thr Phe Ala Ser Aro Ara 
385 390 395 M 400 

Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Gly Arg Val 
405 410 415 

Val Arg Thr Thr Ser Ser He Gin His Pro Val Lys Thr Thr Asn Tyr 
420 425 430 

Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe Ser Ser 
435 440 445 

Arg Ala Gly He Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu Leu 
450 455 460 

Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn Thr 
465 470 475 480 

Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Asn Gin 
485 490 495 

Ala Trp Arg Val Gly Tyr Asp He Thr Ser Gly Tyr Arg Val Pro Asn 
500 505 510 

Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn Trp Leu 
515 520 525 

Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu Ser 
530 535 540 

Leu Gin Gly Arg Ser Glu Lys Gly Met Leu Asp Ala Asn Leu Tyr Gin 
545 550 555 560 

Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr Thr Ser 
565 570 575 

Gly Thr Pro Gly Cys Thr Glu Glu Asn Ala Tyr Tyr Ser He Cys Ser 
580 585 590 

Asp Pro Tyr Lys Glu Lys Leu Asp Trp Gin Met Lys Asn He Asp Lys 
595 600 60S 

Ala Arg He Arg Gly He Glu Leu Thr Gly Arg Leu Asn Val Asp Lvs 
610 615 620 

Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu Glv 
625 630 635 640 

Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr Gin 
645 650 655 

Pro Leu Lys Val He Ala Gly He Asp Tyr Glu Ser Pro Ser Glu Lys 
660 665 670 
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Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Val Lys 
675 680 685 

Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr Pro Leu 

695 700 

Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr Val 
* 710 715 720 

Phe Asp Met Tyr Gly Phe Tyr Lys Pro Val Lys Asn Leu Thr Leu Aro 
725 730 735 

Ala Gly Val Tyr Asn Leu Phe Asn Arg Lys Tyr Thr Thr Trp Asp Ser 
740 745 750 

Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arg Asp 
755 760 765 3 * 

Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Gly Arg Asn Tyr Ala Val 
/ / U 775 



780 



Ser Leu Glu Trp Lys Phe 
785 790 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2375 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2375 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



mI? CCA TTA CAA ATG CCC CCT ATC GCC GCG CTG CTC GGC AGT ATT 

Met Lys Pro Leu Gin Met Pro Pro He Ala Ala Leu Leu Gly Ser lie 
1 5 10 



15 



TTC GGC AAT CCG GTC TTT GCG GCA GAT GAA GCT GCA ACT GAA ACC ACA 
Phe Gly Asn Pro Val Phe Ala Ala Asp Glu Ala Ala Thr Glu Thr 52 
20 25 30 



CCC GTT AAG GCA GAG GTA AAA GCA GTG CGC GTT AAA GGT CAG r G r aut 
Pro Val Lys Ala Glu Val Lys Ala Val Arg Val JJJ §?y gj £g US 
35 40 45 



48 



96 



144 



GCG CCT GCG GCT GTG GAA CGC GTC AAC CTT AAC CGT ATC AAA CAA GAA 192 
Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg lie Lys Gin Glu 
50 55 60 

£2? t7- f GC *** GAC m GTG CGC ™T TCC ACC GAT GTC GGC 240 

Me| lie Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 

65 7 ° 75 80 

TTG AGC GAC AGG AGC CGT CAT CAA AAA GGC TTT GCC ATT CGC GGC GTG 288 
Leu Ser Asp Arg Ser Arg His Gin Lys Gly Phe Ala He Axg Gly Val 
85 go g 5 
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336 



384 



432 



480 



576 



672 



GAA GGC GAC CGT GTC GGC GTT AGT ATT GAC GGC GTA AAC CTG CCT GAT ' 
Glu Gly Asp Arg Val Gly Val Ser lie Asp Gly Val Asn Leu Pro Asp 
100 105 no 

TCC GAA GAA AAC TCG CTG TAC GCC CGT TAT GGC AAC TTC AAC AGC TCG 
Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGT CTG TCT ATC GAC CCC GAA CTC GTG CGC AAC ATC GAC ATC GTA AAA 
Arg Leu Ser lie Asp Pro Glu Leu Val Arg Asn lie Asp lie Val Lvs 
130 135 .140 

GGG GCG GAC TCT TTC AAT ACC GGC AGC GGC GCC TTG GGC GGC GGT GTG 
Gly . Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Glv Val 

145 "0 155 y y y ^ 

AAT TAC CAA ACC CTG CAA GGA CGT GAC TTA CTG TTG CCT GAA CGG CAG 528 
Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Pro Glu Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAC AGC ACG CGT AAC CGT GAA TGG 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
"0 185 190 

ACA AAT ACC CTC GGT TTC GGC GTG AGC AAC GAC CGC GTG GAT GCC GCT 624 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGG CGC GGC CAT GAA ACT GAA AGC GCG GGC AAG 
Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Lys 
210 215 220 

CGT GGT TAT CCG GTA GAG GGT GCT GGT AGC GGA GCG AAT ATC CGT GGT 720 
Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala Asn lie Arg Glv 
225 230 235 240 

If? ff G » GC ^ GT ?T CCT GAT CCG TCC CAA CAC AAA TAC CAC AGC TTC 768 
Ser Ala Arg Gly lie Pro Asp Pro Ser Gin His Lys Tyr His Ser Phe 
245 250 255 

TTG GGT AAG ATT GCT TAT CAA ATC AAC GAC AAC CAC CGC ATC GGC GCA 816 
Leu Gly Lys lie Ala Tyr Gin He Asn Asp Asn His Arg He Gly Ala 
260 265 270 

TCG CTC AAC GGT CAG CAG GGG CAT AAT TAC ACG GTT GAA GAG TCT TAC 864 
Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 

AAC CTG CTT GCT TCT TAT TGG CGT GAA GCT GAC GAT GTC AAC AGA CGG 912 
290 ^ 295 G1U Ala Asp Asp Val Asn Arg Arg 

CGT AAC ACC AAC CTC TTT TAC GAA TGG ACG CCG GAA TCC GAC CGG TTG 960 
Arg Asn Thr Asn Leu Phe Tyr Glu Trp Thr Pro Glu Ser Asp Arg Leu 
305 310 315 320 

TCT ATG GTA AAA GCG GAT GTC GAT TAT CAA AAA ACC AAA GTA TCT GCG 1008 
Ser Met Val Lys Ala Asp Val Asp Tyr Gin Lys Thr Lys Val Ser Ala 
325 330 335 

GTC AAC TAC AAA GGT TCG TTC CCG ACG AAT TAC ACC ACA TGG GAA ACC 1056 
Val Asn Tyr Lys Gly Ser Phe Pro Thr Asn Tyr Thr Thr Trp Glu Thr 
340 345 3S £ 

GAG TAC CAT AAA AAG GAA GTT GGC GAA ATC TAT AAC CGC AGC ATG GAT 1104 
Glu Tyr His Lys Lys Glu Val Gly Glu He Tyr Asn Arg Ser Met Asp 
355 360 365 
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T*2 Phf ?** » GT ACG CTG CGT ATG GAC CAT CCG TTG CAA 

Thr Thr Phe Lys Arg He Thr Leu Arg Met Asp Ser His Pro Leu Gl£ 

375 380 

CTC GGG GGG GGG CGA CAC CGC CTG TCG TTC AAA ACC TTT GCC rrn n»r< 
Leu oiy Gly Gly Arg His Arg Leu Ser Phe J£ J£ J£ £a Sy SJn 

390 395 ' 400 

CGT GAT TTT GAA AAC TTA AAC CGC GAC GAT TAC TAC TTC AGC GGC CGT 
Arg Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Se Ser oty Sg 
405 410 415 a 

GTT GTT CGA ACC ACC AAC ACT ATC CAG CAT CCG GTG AAA ACC ACC AAC 
Val Val Arg Thr Thr Asn Ser lie Gin His Pro Val f£ ?Sr Thr 
420 425 430 



1152 



1200 



1248 



1296 



?Jr S3 Phf V* r CTG TCC GAC °* ATC <*»> TGG AAC GAC GTG TTC ACT 
Tyr Gly Phe Ser Leu Ser Asp Gin lie Gin Trp Asn Asp Val Se Sel 

440 445 

AGC CGC GCA GGT ATC CGT TAC GAC CAC ACC AAA ATG ACG CCT CAG eaa 
Ser Arg Ala Gly He Arg Tyr Asp His Thr ™ 5e? ?S Pro Cln X£ 

455 460 

llf, GC ° GAC TGT CAT GCT TGT GAC *** ACA CCG CCT GCA GCC AAC 

Leu Asn Ala Asp Cys His Ala Cys Asp Lys Thr Pro Pro £2 

470 4 ? 5 480 

ACT TAT AAA GGC TGG AGC GGA TTT GTC GGC TTG GCG GCG CAG CTG AGP 
Thr Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu 111 S 32 sir 
485 490 4 95 

CAA ACA TGG CGT TTG GGT TAC GAT GTG ACC TCA GGT TTC CGC GTG CCG 

Gin Thr Trp Arg Leu Gly Tyr Asp Val Thr Ser Gly Phe Arg Val Pro 
b0 ° 505 5xo 

AAT GCG TCT GAA GTG TAT TTC ACT TAC AAC CAC GGT TCG GGC ACT TGG 
Asn Ala Ser Glu Val Tyr Phe Thr Tyr Asn His gTJ J2 §?y £2 

520 525 

AAG CCT AAT CCT AAT TTG AAG GCA GAA CGC AGC Acr arr rnr ™^ 
Lys Pro Asn Pro Asn Leu Lys Ala £g S2r ?£ Ss ?£r l32 

5JU 535 



540 



565 570 



575 



AGC GGC ACA CCC GGC TGT ACT GAG GAG GAT GCT TAC TAC TAT AGA Trr 
Ser Gly Thr Pro Gly Cys Thr Glu Glu Asp Ala Tyr Tyr Tyr j£J SS 
580 585 590 

AGC GAC CCC TAC AAA GAA AAA CTG GAT TGG CAG ATG AAA AAT ATC GAC 
Ser Asp Pro Tyr Lys Glu Lys Leu Asp Trp Gin Se? J£ JS !h Sp 

a " 600 6 o5 



w« n? C 2°* .7° CGC GGT ATC GAG TTG ACA GGC CGT CTG AAT GTG GAC 
Lys Ala Arg He Arg Gly lie Glu Leu Thr Gly Arg £n S2 2£ 



1344 



1392 



1440 



1488 



1536 



1584 



1680 



J GG ?J G GGG CGC GGC GAC AAA GGG ACA CTG GAT GCC AAC CTG TAT 

Ser Leu Gin Gly Arg Gly Asp Lys Gly Thr Leu Asp Aii Le2 ™ 

550 555 sec 

Gl£ J2S P A ^ C I T C CT ° TCG GAA GAG CA G AAT CTG ACT GTC 1728 

Gin Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Asn Leu Thr Val 



1776 



1824 



1872 
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AAA GTA GCG TCT TTT GTT CCT GAG GGT TGG AAA CTG TTC GGC TCC rrr -. Q -,n 
Lys Val Ala Ser Phe Val Pro Glu. Gly Trp JJi 2J S£ Sr 22 

6 " 630 635 640 

GGT TAT GCG AAA AGC AAA CTG TCG GGC GAC AAC AGC CTG CTG TCC ACA 196 8 

Gly Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Th7 
645 650 S5S 

CAG CCG CTG AAA GTG ATT GCC GGT ATC GAC TAT GAA AGT CCG AGC GAA 2016 
Gin Pro Leu Lys Val He Ala Gly He Asp Tyr Glu Ser Pro Ser Glu 
660 665 670 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Lys Pro Leu Gin Met Pro Pro He Ala Ala Leu Leu Gly Ser He 
1 5 10 



15 



Phe Gly Asn Pro Val Phe Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

Pro Val Lys Ala Glu Val Lys Ala Val Arg Val Lys Gly Gin Arg Asn 

35 40 45 



55 



2064 



2112 



2160 



AAA TGG GGC GTA TTC TCC CGC CTG ACC TAT CTA GGC GCG AAA AAG GTC 
Lys Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Va? 
575 , 680 685 

AAA GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC TGG GGT ACQ CCT 
Lys Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr Pro 
690 695 700 

TTG CAG AAA AAG GTA AAA GAT TAC CCG TGG CTG AAC AAG TCG GCT TAT 
Leu Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr 
° 5 710 715 720 

vJ? SET ? AT m T ? T* C °? C TAC *** CCG GCT AAA AAC CTG ACT TTG 2208 

Val Phe Asp Met Tyr Gly Phe Tyr Lys Pro Ala Lys Asn Leu Thr Leu 

725 730 735 

CGT GCA GGC GTG TAC AAC CTG TTC AAC CGC AAA TAC ACC ACT TGG rax „« 
Arg Ala Gly Val Tyr Asn Leu Phe Asn Arg iy7 T?r JSr Thl ™ 2£ 

740 745 750 

TCC CTG CGC GGT TTA TAT AGC TAC AGC ACC ACC AAT GCG GTC GAC CGC 23 04 

Ser Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arg 
755 760 765 

GAT GGC AAA GGC TTA GAC CGC TAC CGC GCC CCA GGC CGC AAT TAC GCC 23 52 

Asp Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Gly Arg Asn Tyr 22 

u 775 730 

GTA TCG CTG GAA TGG AAG TTT TAA 
Val Ser Leu Glu Trp Lys Phe * 
785 790 
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Ala Pro Ala Ala Val Glu Arg val Asn Leu Asn Arg He Lys Gin Glu 
50 55 60 

Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Glv 
65 70 75 * go 

Leu Ser Asp Arg Ser Arg His Gin Lys Gly Phe Ala He Arg Glv Val 
85 90 9 | 

Glu Gly Asp Arg Val Gly Val Ser He Asp Gly Val Asn Leu Pro Asp 
1°° 105 110 v 

Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Asp He Val Lys 
130 135 140 

Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 15 0 155 1S0 

Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Pro Glu Arg Gin 
165 170 i7 5 

Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 iso 

Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Lys 
210 215 220 

Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala Asn He Arg Gly 
225 2 30 235 240 

Ser Ala Arg Gly He Pro Asp Pro Ser Gin His Lys Tyr His Ser Phe 
245 250 255 

Leu Gly Lys He Ala Tyr Gin He Asn Asp Asn His Arg He Gly Ala 
260 265 270 

Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tvr 
275 280 285 

Asn Leu Leu Ala Ser Tyr Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

Arg Asn Thr Asn Leu Phe Tyr Glu Trp Thr Pro Glu Ser Asp Arg Leu 
305 31° 315 320 

Ser Met Val Lys Ala Asp Val Asp Tyr Gin Lys Thr Lys Val Ser Ala 
325 330 335 

Val Asn Tyr Lys Gly Ser Phe Pro Thr Asn Tyr Thr Thr Trp Glu Thr 
340 345 3S0 

Glu Tyr His Lys Lys Glu Val Gly Glu He Tyr Asn Arg Ser Met Asp 
355 360 365 

Thr Thr Phe Lys Arg He Thr Leu Arg Met Asp Ser His Pro Leu Gin 
370 375 380 

Leu Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Gly Gin 
385 390 395 400 
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Arg Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Gly Arg 

Val val Arg Thr Thr Asn Ser lie Gin His Pro Val Lys T hr Thr Asn 

425 430 
Tyr Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe Ser 



440 44S 



Ser Arg Ala Gly He Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu 



460 



Leu Asn Ala Asp Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn 

475 480 
Thr Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Ser 

Gin Thr Trp Arg Leu Gly Tyr Asp Val Thr Ser Gly Phe Arg Val Pro 

Asn Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Thr Trp 

Lys Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu 



535 540 



Ser Leu Gin Gly Arg Gly Asp Lys Gly Thr Leu Asp Ala Asn Leu Ty 
Gin Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Asn Leu Thr Val 



560 

Kck - Glu Gin Asn Leu Thr 

565 S70 57S 

Ser Gly Thr Pro Gly Cys Thr Glu Glu Asp Ala Tyr Tyr Tyr Arg Cys 

585 59q 

Ser Asp Pro Tyr Lys Glu Lys Leu Asp Trp Gin Met Lys Asn He Asp 

600 605 

Lys Ala Arg He Arg Gly lie Glu Leu Thr Gly Arg Leu Asn Val Asp 



620 



Lys val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu 

635 640 
Gly Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn ser Leu Leu Ser Thr 

5 65° 655 

Gin Pro Leu Lys Val He Ala Gly He Asp Tyr Glu Ser Pro Ser Glu 

665 670 
Lys Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Val 



680 685 



Lys Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr Pro 



700 



Leu Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr 



720 



Val Phe Asp Met Tyr Gly Phe Tyr Lys Pro Ala Lys Asn Leu Thr Leu 



730 73S 



Arg Ala Gly Val Tyr Asn Leu Phe Asn Arg Lys Tyr Thr Thr Trp Asp 
7 *° 745 750 c * 
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Ser Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arc? 
■ 755 760 765 

Asp Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Gly Arg Asn Tyr Ala 
770 775 780 

Val Ser Leu Glu Trp Lys Phe 
785 790 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2379 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..2379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

ATG AAA CCA TTA CAA ATG CTC CCT ATC GCC GCG CTG GTC GGC AGT ATT 48 
Met Lys Pro Leu Gin Met Leu Pro lie Ala Ala Leu Val Gly Ser He 
15 10 15 

TTC GGC AAT CCG GTC TTT GCG GCA GAT GAA GCT GCA ACT GAA ACC ACA 96 
Phe Gly Asn Pro Val Phe Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

CCC GTT AAG GCA GAG GTA AAA GCA GTG CGC GTT AAA GGC CAG CGC AAT 144 
Pro Val Lys Ala Glu Val Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 

GCG CCT GCG GCT GTG GAA CGC GTC AAC CTT AAC CGT ATC AAA CAA GAA 192 
Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg He Lys Gin Glu 
50 55 60 

ATG ATA CGC GAC AAC AAA GAC TTG GTG CGC TAT TCC ACC GAT GTC GGC 24 0 

Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

TTG AGC GAC AGC GGC CGC CAT CAA AAA GGC TTT GCT GTT CGC GGC GTG 288 
Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

GAA GGC AAC CGT GTC GGC GTG AGC ATA GAC GGC GTA AAC CTG CCT GAT 336 
Glu Gly Asn Arg Val Gly Val Ser He Asp Gly Val Asn Leu Pro Asp 
100 105 no 

TCC GAA GAA AAC TCG CTG TAC GCC CGT TAT GGC AAC TTC AAC AGC TCG 3 84 

Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGT CTG TCT ATC GAC CCC GAA CTC GTG CGC AAC ATC GAC ATC GTA AAA 432 
Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Asp He Val Lys 
130 135 140 

GGG GCG GAC TCT TTC AAT ACC GGC AGC GGC GCC TTG GGC GGC GGT GTG 480 
Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

- 58 - 



8NSDOCID: <WO 9612020A2_I_> 



WO 96/12020 V W 

PCT/US95/13623 



576 



AAT TAC CAA ACC CTG CAA GGA CGT GAC TTA CTG TTG CCT GAA CGG CAG 52 8 

Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Pro Glu Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAC AGC ACG CGT AAC CGT GAA TGG 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 190 

ACA AAT ACC CTC GGT TTC GGC GTG AGC AAC GAC CGC GTG GAT GCC GCT 624 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGG CGC GGC CAT GAA ACT GAA AGC GCG GGC AAG 672 
Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Lys 
210 215 220 

CGT GGT TAT CCG GTA GAG GGT GCT GGT AGC GGA GCG AAT ATC CGT GGT 720 
Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala Asn lie Arg Gly 
225 230 235 240 

TCT GCG CGC GGT ATT CCT GAT CCG TCC CAA CAC AAA TAC CAC AGC TTC 768 
Ser Ala Arg Gly He Pro Asp Pro Ser Gin His Lys Tyr His Ser Phe 
245 250 255 

TTG GGT AAG ATT GCT TAT CAA ATC AAC GAC AAC CAC CGC ATC GGC GCA 816 
Leu Gly Lys He Ala Tyr Gin He Asn Asp Asn His Arg He Gly Ala 
260 265 270 

TCG CTC AAC GGT CAG CAG GGG CAT AAT TAC ACG GTT GAA GAG TCT TAC 864 
Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 

AAC CTG CTT GCT TCT TAT TGG CGT GAA GCT GAC GAT GTC AAC AGA CGG 912 
Asn Leu Leu Ala Ser Tyr Trp Arg Glu Ala Asp Asp Val Asn Arq Arq 
290 295 300 

CGT AAC ACC AAC CTC TTT TAC GAA TGG ACG CCG GAA TCC GAC CGG TTG 960 
Arg Asn Thr Asn Leu Phe Tyr Glu Trp Thr Pro Glu Ser Asp Arq Leu 
305 310 315 ^ 320 

TCT ATG GTA AAA GCG GAT GTC GAT TAT CAA AAA ACC AAA GTA TCT GCG 1008 
Ser Met Val Lys Ala Asp Val Asp Tyr Gin Lys Thr Lys Val Ser Ala 
325 330 335 

GTC AAC TAC AAA GGT TCG TTC CCG ATA GAG GAT TCT TCC ACC TTG ACA 1056 
Val Asn Tyr Lys Gly Ser Phe Pro He Glu Asp Ser Ser Thr Leu Thr 
340 345 350 

CGT AAC TAC AAT CAA AAG GAC TTG GAT GAA ATC TAC AAC CGC AGT ATG 1104 
Arg Asn Tyr Asn Gin Lys Asp Leu Asp Glu He Tyr Asn Arg Ser Met 
355 360 365 

GAT ACC CGC TTC AAA CGC ATT ACC CTG CGT TTG GAC AGC CAT CCG TTG 1152 
Asp Thr Arg Phe Lys Arg He Thr Leu Arg Leu Asp Ser His Pro Leu 
370 375 380 

CAA CTC GGG GGG GGG CGA CAC CGC CTG TCG TTT AAA ACT TTC GCC AGC 1200 
Gin Leu Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Ser 
385 390 395 400 

CGC CGT GAT TTT GAA AAC CTA AAC CGC GAC GAT TAT TAC TTC AGC GGC 124 8 

Arg Arg Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Gly 
405 410 415 

CGT GTT GTT CGA ACC ACC AGC AGT ATC CAG CAT CCG GTG AAA ACC ACC 12 96 

Arg Val Val Arg Thr Thr Ser Ser He Gin His Pro Val Lys Thr Thr 
420 425 430 
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t AC I AC S? T TI C TCA CTG TCT GAC CAA ATT CA * TGG AAC GAC GTG TTC 
Asn Tyr Gly Phe Ser Leu Ser Asp Gin lie Gin Trp Asn Asp Val Phe 
435 440 445 



1344 



AGT AGC CGC GCA GGT ATC CGT TAC GAT CAT ACC AAA ATG ACQ CCT CAG 
Ser Ser Arg Ala Gly lie Arg Tyr Asp His Thr Lys Met Thr Pro Gin 
450 455 460 

GAA TTG AAT GCC GAG TGT CAT GCT TGT GAC AAA ACA CCG CCT GCA GCC 
Glu Leu Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala 
465 470 475 480 

AAC ACT TAT AAA GGC TGG AGC GGT TTT GTC GGC TTG GCG GCG CAA CTG 
Asn Thr Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu 
48 5 490 495 

AAT CAG GCT TGG CGT GTC GGT TAC GAC ATT ACT TCC GGC TAC CGT GTC 
Asn Gin Ala Trp Arg Val Gly Tyr Asp He Thr Ser Gly Tyr Aro Val 
500 505 510 

CCC AAT GCG TCC GAA GTG TAT TTC ACT TAC AAC CAC GGT TCG GGT AAT 
Pro Asn Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn 
515 520 525 

TGG CTG CCC AAT CCC AAC CTG AAA GCC GAG CGC ACQ ACC ACC CAC ACC 
Trp Leu Pro Asn Pro Asn Leu Lys Ala Glu Arg Thr Thr Thr His Thr 
530 535 540 

CTC TCT CTG CAA GGC CGC AGC GAA AAA GGT ACT TTG GAT GCC AAC CTG 
Leu Ser Leu Gin Gly Arg Ser Glu Lys Gly Thr Leu Asp Ala Asn Leu 
545 550 555 560 

TAT CAA AGC AAT TAC CGC AAT TTC CTG TCT GAA GAG CAG AAG CTG ACC 
Tyr Gin Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr 
565 570 575 

ACC AGC GGC GAT GTC AGC TGT ACT CAG ATG AAT TAC TAC TAC GGT ATG 
Thr Ser Gly Asp Val Ser Cys Thr Gin Met Asn Tyr Tyr Tyr Gly Met 
580 585 590 

TGT AGC AAT CCT TAT TCC GAA AAA CTG GAA TGG CAG ATG CAA AAT ATC 
Cys Ser Asn Pro Tyr Ser Glu Lys Leu Glu Trp Gin Met Gin Asn lie 
595 SOO 60S 

GAC AAG GCC AGA ATC CGC GGT ATC GAG CTG ACG GGC CGT CTG AAT GTG 
Asp Lys Ala Arg He Arg Gly He Glu Leu Thr Gly Arg Leu Asn Val 
610 615 620 

GAC AAA GTA GCG TCT TTT GTT CCT GAG GGC TGG AAA CTG TTC GGC TCG 
Asp Lys Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser 
625 630 635 640 

CTG GGT TAT GCG AAA AGC AAA CTG TCG GGC GAC AAC AGC CTG CTG TCC 
Leu Gly Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser 
645 650 655 

ACC CAG CCG TTG AAA GTG ATT GCC GGT ATC GAC TAT GAA AGT CCG AGC 
Thr Gin Pro Leu Lys Val He Ala Gly He Asp Tyr Glu Ser Pro Ser 
660 665 670 

GAA AAA TGG GGC GTG TTC TCC CGC CTG ACC TAT CTG GGC GCG AAA AAG 
Glu Lys Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys 
675 680 685 

GTC AAA GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC TGG GGT ACG 2112 
Val Lys Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr 
690 695 700 
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CCT TTG CAG AAA AAG GTA AAA GAT TAG CCG TGG CTG AAC AAG TCG GCT 2160 
Pro Leu Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala 
7 05 710 715 720 

TAT GTG TTC GAT ATG TAC GGC TTC TAC AAA CCG GTG AAA AAC CTG ACT 22 08 

Tyr Val Phe Asp Met Tyr Gly Phe Tyr Lys Pro Val Lys Asn Leu Thr 
725 730 735 

TTG CGT GCA GGC GTA TAT AAT GTG TTC AAC CGC AAA TAC ACC ACT TGG 2256 
Leu Arg Ala Gly Val Tyr Asn Val Phe Asn Arg Lys Tyr Thr Thr Trp 
740 745 750 

GAT TCC CTG CGC GGC CTG TAT AGC TAC AGC ACC ACC AAC TCG GTC GAC 2 304 

Asp Ser Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ser Val Asp 
755 760 765 

CGC GAT GGC AAA GGC TTA GAC CGC TAC CGC GCC CCA AGC CGT AAT TAC 2352 
Arg Asp Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Ser Arg Asn Tyr 
770 775 780 

GCC GTA TCG CTG GAA TGG AAG TTT TAA 2379 
Ala Val Ser Leu Glu Trp Lys Phe * 
785 790 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 792 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Lys Pro Leu Gin Met Leu Pro lie Ala Ala Leu Val Gly Ser lie 
1 5 10 15 

Phe Gly Asn Pro Val Phe Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

Pro Val Lys Ala Glu Val Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 

Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg lie Lys Gin Glu 
50 55 60 

Met lie Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

Glu Gly Asn Arg Val Gly Val Ser lie Asp Gly Val Asn Leu Pro Asp 
100 105 110 

Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

Arg Leu Ser lie Asp Pro Glu Leu Val Arg Asn lie Asp lie Val Lys 
130 135 140 

Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 
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Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Pro Glu Arg Gin 
165 170 175 

Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 

Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
iS5 200 



205 



Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Lys 

Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala Asn He Arg Gly 
225 230 23S S 24 o 

Ser Ala Arg Gly lie Pro Asp Pro Ser Gin His Lys Tyr His Ser Phe 
245 250 255 

Leu Gly Lys lie Ala Tyr Gin He Asn Asp Asn His Arg He Gly Ala 
260 265 270 

Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tvr 
275 280 285 

Asn Leu Leu Ala Ser Tyr Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

Arg Asn Thr Asn Leu Phe Tyr Glu Trp Thr Pro Glu Ser Asp Arg Leu 

310 315 320 

Ser Met Val Lys Ala Asp Val Asp Tyr Gin Lys Thr Lys Val Ser Ala 
325 330 335 

Val Asn Tyr Lys Gly Ser Phe Pro He Glu Asp Ser Ser Thr Leu Thr 
340 345 350 

Arg Asn Tyr Asn Gin Lys Asp Leu Asp Glu He Tyr Asn Arg Ser Met 
355 360 365 

Asp Thr Arg Phe Lys Arg He Thr Leu Arg Leu Asp Ser His Pro Leu 

375 3 8 o 

Gin Leu Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Ser 
5 390 395 400 

Arg Arg Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Glv 
405 410 415 7 

Arg Val Val Arg Thr Thr Ser Ser He Gin His Pro Val Lys Thr Thr 
420 425 430 

Asn Tyr Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe 
435 440 445 

Ser Ser Arg Ala Gly He Arg Tyr Asp His Thr Lys Met Thr Pro Gin 
450 4 55 460 

Glu Leu Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala 
465 470 475 4 8 o 

Asn Thr Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu 
485 49o 495 

Asn Gin Ala Trp Arg Val Gly Tyr Asp He Thr Ser Gly Tyr Arg Val 
500 505 5 io 
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Pro Asn Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn 
515 520 525 

Trp Leu Pro Asn Pro Asn Leu Lys Ala Glu Arg Thr Thr Thr His Thr 
530 535 540 

Leu Ser Leu Gin Gly Arg Ser Glu Lys Gly Thr Leu Asp Ala Asn Leu 
545 550 555 560 

Tyr Gin Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr 
565 570 575 

Thr Ser Gly Asp Val Ser Cys Thr Gin Met Asn Tyr Tyr Tyr Gly Met 
580 585 590 

Cys Ser Asn Pro Tyr Ser Glu Lys Leu Glu Trp Gin Met Gin Asn lie 
595 600 605 

Asp Lys Ala Arg lie Arg Gly lie Glu Leu Thr Gly Arg Leu Asn Val 
610 615 620 

Asp Lys Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser 
625 630 635 640 

Leu Gly Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser 
645 650 655 

Thr Gin Pro Leu Lys Val lie Ala Gly lie Asp Tyr Glu Ser Pro Ser 
660 665 670 

Glu Lys Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys 
675 680 685 

Val Lys Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr 
690 695 700 

Pro Leu Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala 
705 710 715 720 

Tyr Val Phe Asp Met Tyr Gly Phe Tyr Lys Pro Val Lys Asn Leu Thr 
725 730 735 

Leu Arg Ala Gly Val Tyr Asn Val Phe Asn Arg Lys Tyr Thr Thr Trp 
740 745 750 

Asp Ser Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ser Val Asp 
755 760 765 

Arg Asp Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Ser Arg Asn Tyr 
770 775 780 

Ala Val Ser Leu Glu Trp Lys Phe 
785 790 



(2) INFORMATION FOR SEQ ID NO: 7: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2378 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
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(B) LOCATION: 1..2373 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ATG AAA CCA TTA CAC ATG CTT CCT ATT GCC GCG CTG GTC GGC AGT ATT 4 8 

Met Lys Pro Leu His Met Leu Pro lie Ala Ala Leu Val Gly Ser lie 
1 5 10 15 

TTC GGC AAT CCG GTC TTG GCA GCG GAT GAA GCT GCA ACC GAA ACC ACA 96 
Phe Gly Asn Pro Val Leu Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

CCC GTT AAA GCA GAG ATA AAA GAA GTG CGC GTT AAA GAC CAG CTT AAT 144 
Pro Val Lys Ala Glu lie Lys Glu Val Arg Val Lys Asp Gin Leu Asn 
35 40 45 

GCG CCT GCA ACC GTG GAA CGT GTC AAC CTC GGC CGC ATT CAA CAG GAA 192 
Ala Pro Ala Thr Val Glu Arg Val Asn Leu Gly Arg lie Gin Gin Glu 
50 55 60 

ATG ATA CGC GAC AAC AAA GAC TTG GTG CGT TAC TCC ACC GAC GTC GGC 24 0 

Met lie Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

TTG AGC GAT AGC GGC CGC CAT CAA AAA GGC TTT GCT GTG CGC GGC GTG 28 8 

Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

GAA GGC AAC CGT GTC GGT GTC AGC ATT GAC GGC GTG AGC CTG CCT GAT 336 
Glu Gly Asn Arg Val Gly Val Ser lie Asp Gly Val Ser Leu Pro Asp 
100 105 110 

TCG GAA GAA AAC TCA CTG TAT GCA CGT TAT GGC AAC TTC AAC AGC TCG 384 
Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGC CTG TCT ATC GAC CCC GAA CTC GTG CGC AAC ATC GAA ATC GCG AAG 432 
Arg Leu Ser lie Asp Pro Glu Leu Val Arg Asn lie Glu lie Ala Lys 
130 135 140 

GGC GCT GAC TCT TTC AAT ACC GGT AGC GGC GCA TTG GGT GGC GGC GTG 480 
Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

AAT TAC CAA ACC CTG CAA GGA CAT GAT TTG CTG TTG GAC GAC AGG CAA 528 
Asn Tyr Gin Thr Leu Gin Gly His Asp Leu Leu Leu Asp Asp Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAC AGC AGC CGC AAC CGC GAA TGG 576 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Ser Arg Asn Arg Glu Trp 
180 185 190 

ACA AAT ACA CTC GGT TTC GGT GTG AGC AAC GAC CGC GTG GAT GCC GCT 624 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGT CGC GGT CAT GAG ACC GAA AGC GCG GGC GAG 6 72 

Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Glu 
210 215 220 

CGT GGC TAT CCG GTA GAG GGT GCT GGC AGC GGA GCA ATT ATC CGT GGT 720 
Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala lie lie Arg Gly 
225 230 235 240 

TCG TCA CGC GGT ATC CCT GAT CCG TCC AAA CAC AAA TAC CAC AAC TTC 768 
Ser Ser Arg Gly lie Pro Asp Pro Ser Lys His Lys Tyr His Asn Phe 
245 250 255 
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TTG GGT AAG ATT GCT TAT CAA ATC AAC GAC AAG CAC CGC ATC GGC CCA 816 
Leu Gly Lys lie Ala Tyr Gin He Asn Asp Lys His Arg He Gly Pro 
260 265 270 

TCG TTT AAC GGC CAG CAG GGG CAT AAT TAC ACG ATT GAA GAG TCT TAT 864 
Ser Phe Asn Gly Gin Gin Gly His Asn Tyr Thr He Glu Glu Ser Tyr 
275 280 285 

AAC CTG ACC GCT TCT TCC TGG CGC GAA GCC GAT GAC GTA AAC AGA CGG 912 
Asn Leu Thr Ala Ser Ser Trp Arg Glu Ala Asp Asp Val Asn Aro Aro 
290 295 300 

CGC AAT GCC AAC CTC TTT TAC GAA TGG ACG CCT GAT TCA AAT TGG CTG 96 0 

Arg Asn Ala Asn Leu Phe Tyr Glu Trp Thr Pro Asp Ser Asn Trp Leu 
305 310 315 320 

TCG TCT TTG AAG GCG GAC TTC GAT TAT CAG ACA ACC AAA GTG GCG GCG 1008 
Ser Ser Leu Lys Ala Asp Phe Asp Tyr Gin Thr Thr Lys Val Ala Ala 
325 330 335 

GTT AAC AAC AAA GGC TCG TTC CCG ACG GAT TAT TCC ACC TGG ACG CGC 1056 
Val Asn Asn Lys Gly Ser Phe Pro Thr Asp Tyr Ser Thr Trp Thr Arg 
340 345 350 

AAC TAT AAT CAG AAG GAT TTG GAG AAT ATA TAC AAC CGC AGC ATG GAC 1104 
Asn Tyr Asn Gin Lys Asp Leu Glu Asn He Tyr Asn Arg Ser Met Asp 
355 360 365 

ACC CGA TTC AAA CGT TTT ACT TTG CGT ATG GAC AGC CAA CCG TTG CAA 1152 
Thr Arg Phe Lys Arg Phe Thr Leu Arg Met Asp Ser Gin Pro Leu Gin 
370 375 380 

CTG GGC GGC CAA CAT CGC TTG TCG CTT AAA ACT TTC GCC AGT CGG CGT 
Leu Gly Gly Gin His Arg Leu Ser Leu Lys Thr Phe Ala Ser Aro Arc? 
385 390 395 y 400 

GAG TTT GAA AAC TTA AAC CGC GAC GAT TAT TAC TTC AGC GAA AGA GTA 1248 
Glu Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Glu Arg Val 
405 410 415 

TCC CGT ACT ACC AGC TCG ATT CAA CAC CCC GTG AAA ACC ACT AAT TAT 12 96 

Ser Arg Thr Thr Ser Ser He Gin His Pro Val Lys Thr Thr Asn Tyr 
420 425 430 

GGT TTC TCA CTG TCT GAT CAA ATC CAA TGG AAC GAC GTG TTC AGC AGC 1344 
Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe Ser Ser 
435 440 445 

CGT GCA GAT ATC CGT TAC GAT CAT ACC AAA ATG ACG CCT CAG GAA TTG 13 92 

Arg Ala Asp He Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu Leu 
450 455 460 

AAT GCC GAG. TGT CAT GCT TGT GAC AAA ACA CCG CCT GCA GCC AAT ACT 144 0 

Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn Thr 
465 470 475 480 

TAT AAA GGC TGG AGC GGA TTT GTC GGT TTG GCG GCG CAA CTG AAT CAG 14 88 

Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Asn Gin 
485 490 495 

GCT TGG CAT GTC GGT TAC GAC ATT ACT TCC GGC TAC CGT GTC CCC AAT 1536 
Ala Trp His Val Gly Tyr Asp He Thr Ser Gly Tyr Arg Val Pro Asn 
500 505 510 

GCG TCC GAA GTG TAT TTC ACT TAC AAC CAC GGT TCG GGT AAT TGG CTG 1584 
Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn Trp Leu 
515 520 525 
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CCC AAT CCC AAC CTG AAA GCC GAG CGC AGC ACC ACC CAC ACC CTG TCT 
Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu Ser 
530 535 540 

CTG CAA GGC CGC AGC GAA AAA GGT ACT TTG GAT GCC AAC CTG TAT CAA 
Leu Gin Gly Arg Ser Glu Lys Gly Thr Leu Asp Ala Asn Leu Tyr Gin 

AAC AAT TAC CGC AAC TTC TTG TCT GAA GAG CAG AAG CTG ACC ACC AGC 
Asn Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr Thr Ser 
565 570 575 

GGC GAT GTC GGC TGT ACT CAG ATG AAT TAC TAC TAC GGT ATG TGT AGC 
Gly Asp Val Gly Cys Thr Gin Met Asn Tyr Tyr Tyr Gly Met Cys Ser 
580 585 59o 

AAT CCT TAT TCC GAA AAA CCG GAA TGG CAG ATG CAA AAT ATC GAT AAG 
Asn Pro Tyr Ser Glu Lys Pro Glu Trp Gin Met Gin Asn He Asp Lys 
595 600 6 05 

GCC CGA ATC CGT GGT CTT GAG CTG ACA GGC CGT CTG AAT GTG ACA AAA 
Ala Arg He Arg Gly Leu Glu Leu Thr Gly Arg Leu Asn Val Thr Lys 
610 615 620 

GTA GCG TCT TTT GTT CCT GAG GGC TGG AAA TTG TTC GGC TCG CTG GGT 
Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu Gly 
625 630 635 6 40 

TAT GCG AAA AGC AAA CTG TCG GGC GAC AAC AGC CTG CTG TCC ACA CAG 
Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr Gin 
645 650 655 

CCG CCG AAA GTG ATT GCC GGT GTC GAC TAC GAA AGC CCG AGC GAA AAA 
Pro Pro Lys Val He Ala Gly Val Asp Tyr Glu Ser Pro Ser Glu Lys 
660 665 670 

TGG GGT GTG TTC TCC CGC CTG ACT TAT CTG GGT GCG AAA AAG GCC AAA 
Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Ala Lys 
675 680 685 

GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC CGG GGT ACG CCT TTG 

ASP £ii Gln ^ Thr Val ^ Glu Asn L V S G1 Y Arg Gly Thr Pro Leu 
690 695 700 

CAG AAA AAG GTA AAA GAT TAC CCG TGG CTG AAC AAG TCG GCT TAT GTG 
Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr Val 
705 710 715 720 

TTT GAT ATG TAC GGC TTC TAC AAA CTG GCT AAA AAC CTG ACT TTG CGT 
Phe Asp Met Tyr Gly Phe Tyr Lys Leu Ala Lys Asn Leu Thr Leu Arg 
725 730 735 

GCA GGC GTA TAT AAT GTG TTC AAC CGC AAA TAC ACC ACT TGG GAT TCC 
Ala Gly Val Tyr Asn Val Phe Asn Arg Lys Tyr Thr Thr Trp Asp Ser 
7 *0 745 750 

CTG CGC GGT TTG TAT AGC TAC AGC ACC ACC AAC GCG GTC GAC CGA GAT 
Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arg Asp 
755 760 765 

GGC AAA GGC TTA GAC CGC TAC CGC GCC TCA GGC CGT AAT TAC GCC GTA 
Gly Lys Gly Leu Asp Arg Tyr Arg Ala Ser Gly Arg Asn Tyr Ala Val 
770 775 780 

TCG CTG GAT TGG AAG TTT TGA ATTCC 
Ser Leu Asp Trp Lys Phe * 
785 790 
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(2) INFORMATION FOR SEQ ID NO ; 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Lys Pro Leu His Met Leu Pro lie Ala Ala Leu Val Gly Ser lie 
1 5 riO 15 

Phe Gly Asn Pro Val Leu Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

Pro Val Lys Ala Glu lie Lys Glu Val Arg Val Lys Asp Gin Leu Asn 
35 40 45 

Ala Pro Ala Thr Val Glu Arg Val Asn -Leu Gly Arg He Gin Gin Glu 
50 55 60 

Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

Glu Gly Asn Arg Val Gly Val Ser He Asp Gly Val Ser Leu Pro Asp 
100 105 110 

Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Glu He Ala Lys 
130 135 140 

Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
14 * 150 155 160 

Asn Tyr Gin Thr Leu Gin Gly His Asp Leu Leu Leu Asp Asp Arg Gin 
165 170 175 

Phe Gly Val Met Met Lys Asn Gly Tyr Ser Ser Arg Asn Arg Glu Trp 
180 185 190 

Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Glu 
210 215 220 

Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala He He Arg Gly 
225 2 30 235 240 

Ser Ser Arg Gly He Pro Asp Pro Ser Lys His Lys Tyr His Asn Phe 
245 250 255 

Leu Gly Lys He Ala Tyr Gin He Asn Asp Lys His Arg He Gly Pro 
260 265 270 

Ser Phe Asn Gly Gin Gin Gly His Asn Tyr Thr He Glu Glu Ser Tyr 
275 280 285 

Asn Leu Thr Ala Ser Ser Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 
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Arg Asn Ala Asn Leu Phe Tyr Glu Trp Thr Pro Asp Ser Asn Trp Leu 
305 310 31S 

Ser Ser Leu Lys Ala Asp Phe Asp Tyr Gin Thr Thr Lys Val Ala Ala 
325 330 335 

Val Asn Asn Lys Gly Ser Phe Pro Thr Asp Tyr Ser Thr Trp Thr Ara 
340 345 350 

Asn Tyr Asn Gin Lys Asp Leu Glu Asn He Tyr Asn Arg Ser Met Asp 
355 360 365 

Thr Arg Phe Lys Arg Phe Thr Leu Arg Met Asp Ser Gin Pro Leu Gin 
370 375 380 

Leu Gly Gly Gin His Arg Leu Ser Leu Lys Thr Phe Ala Ser Arg Arg 
385 390 395 * 40 5 

Glu Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Glu Arg Val 
405 410 415 

Ser Arg Thr Thr Ser Ser He Gin His Pro Val Lys Thr Thr Asn Tvr 
420 425 430 

Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe Ser Ser 
435 440 445 

Arg Ala Asp He Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu Leu 
450 455 460 

Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn Thr 
465 470 475 480 

Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Asn Gin 
485 490 495 

Ala Trp His Val Gly Tyr Asp He Thr Ser Gly Tyr Arg Val Pro Asn 
500 505 510 

Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn Trp Leu 
515 520 525 

Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu Ser 
530 535 540 

Leu Gin Gly Arg Ser Glu Lys Gly Thr Leu Asp Ala Asn Leu Tyr Gin 
545 550 555 560 

Asn Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr Thr Ser 
565 570 S75 

Gly Asp Val Gly Cys Thr Gin Met Asn Tyr Tyr Tyr Gly Met Cvs Ser 
580 585 59o 

Asn Pro Tyr Ser Glu Lys Pro Glu Trp Gin Met Gin Asn He Asp Lys 
59 5 600 605 

^ff 116 AX9 Gly Leu Glu Leu Thr G1 y ATS Leu Asn Val Thr Lys 
610 615 620 

Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu Gly 
625 630 635 640 

Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr Gin 
64S 650 655 
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Pro Pro Lys Val He Ala Gly Val Asp Tyr Glu Ser Pro Ser Glu Lys 
660 665 670 

Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Ala Lvs 
G 7 * 680 685 

Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Arg Gly Thr Pro Leu 
690 695 700 

Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr Val 
705 710 715 720 

Phe Asp Met Tyr Gly Phe Tyr Lys Leu Ala Lys Asn Leu Thr Leu Arq 
72 5 730 735 

Ala Gly Val Tyr Asn Val Phe Asn Arg Lys Tyr Thr Thr Trp Asp Ser 
740 745 750 

Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arg Asp 
75 5 760 765 

Gly Lys Gly Leu Asp Arg Tyr Arg Ala Ser Gly Arg Asn Tyr Ala Val 
770 775 780 

Ser Leu Asp Trp Lys Phe 
785 790 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Met Gin Gin Gin His Leu Phe Arg Leu Asn He Leu Cys Leu Ser Leu 
1 5 !0 



15 



Met Thr Ala Leu Pro Val Tyr Ala Glu Asn Val Gin Ala Glu Gin Ala 
20 25 30 

Gin Glu Lys Gin Leu Asp Thr He Val Lys Ala Lys Lys Gin Lys Thr 
35 40 45 

Arg Arg Asp Asn Glu Val Thr Gly Leu Gly Lys Leu Val Lys Ser Ser 
50 55 60 

Asp Thr Leu Ser Lys Glu Gin Val Leu Asn He Arg Asp Leu Thr Arg 
65 7 0 75 80 

Tyr Asp Pro Gly He Ala Val Val Glu Gin Gly Arg Gly Ala Ser Ser 
85 90 95 

Gly Tyr Ser He Arg Gly Met Asp Lys Asn Arg Val Ser Leu Thr Val 
1°0 105 no 

Asp Gly Val Ser Gin He Gin Ser Tyr Thr Ala Gin Ala Ala Leu Gly 
H5 120 125 

Gly Thr Arg Thr Ala Gly Ser Ser Gly Ala He Asn Glu He Glu Tyr 
130 135 140 
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Glu Asn Val Lys Ala Val Glu lie Ser Lys Gly Ser Asn Ser Ser Glu 
145 150 155 160 

Tyr Gly Asn Gly Ala Leu Ala Gly Ser Val Ala Phe Gin Thr Lys Thr 

165 170 175 

Ala Ala Asp lie lie Gly Glu Gly Lys Gin Trp Gly He Gin Ser Lys 

iso IBs 190 y 

Thr Ala Tyr Ser Gly Lys Asp His Ala Leu Thr Gin Ser Leu Ala Leu 
195 200 205 

Ala Gly Arg Ser Gly Gly Ala Glu Ala Leu Leu lie Tyr Thr Lys Arg 

Arg Gly Arg Glu lie His Ala His Lys Asp Ala Gly Lys Gly Val Gin 
-" 5 2 30 235 240 

Ser Phe Asn Arg Leu Pro He Cys Arg Phe Gly Asn Asn Thr Tyr Thr 
24 5 250 255 

Asp Cys Thr Pro Arg Asn He Gly Gly Asn Gly Tyr Tyr Ala Ala Val 
260 265 2 7o 

Gin Asp Asn Val Arg Leu Gly Arg Trp Ala Asp Val Gly Ala Gly He 
275 280 285 

Arg Tyr Asp Tyr Arg Ser Thr His Ser Glu Asp Lys ser Val Ser Thr 
290 295 300 

Gly Thr His Arg Asn Leu Ser Trp Asn Ala Gly Val Val Leu Lys Pro 
305 310 315 320 

Phe Thr Trp Met Asp Leu Thr Tyr Arg Ala Ser Thr Gly Phe Arg Leu 
325 330 335 

Pro Ser Phe Ala Glu Met Tyr Gly Trp Arg Ala Gly Glu Ser Leu Lys 
340 345 3 5 o 

Thr Leu Asp Leu Lys Pro Glu Lys Ser Phe Asn Arg Glu Ala Gly He 
355 360 355 

Val Phe Lys Gly Asp Phe Gly Asn Leu Glu Ala Ser Tyr Phe Asn Asn 
370 375 380 

Ala Tyr Arg Asp Leu He Ala Phe Gly Tyr Glu Thr Arg Thr Gin Asn 
385 390 395 400 

Gly Gin Thr Ser Ala Ser Gly Asp Pro Gly Tyr Arg Asn Ala Gin Asn 
4 °5 410 415 

Ala Arg He Ala Gly He Asn He Leu Gly Lys He Asp Trp His Gly 
420 425 430 

Val Trp Gly Gly Leu Pro Asp Gly Leu Tyr Ser Thr Leu Ala Tyr Asn 
435 440 445 

^9 Va * Asp Ala Asp Arg Ala Asp Arg Thr Phe Val Thr 

450 455 460 

Ser Tyr Leu Phe Asp Ala Val Gin Pro Ser Arg Tyr Val Leu Gly Leu 
465 470 475 480 

Gly Tyr Asp His Pro Asp Gly He Trp Gly He Asn Thr Met Phe Thr 
4 85 490 495 

Tyr Ser Lys Ala Lys Ser Val Asp Glu Leu Leu Gly Ser Gin Ala Leu 
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500 505 510 

Leu Asn Gly Asn Ala Asn Ala Lys Lys Ala Ala Ser Arg Arg Thr Arg 
515 520 525 

Pro Trp Tyr Val Thr Asp Val Ser Gly Tyr Tyr Asn He Lys Lys His 
530 535 540 

Leu Thr Leu Arg Ala Gly Val Tyr Asn Leu Leu Asn Tyr Arg Tyr Val 
545 550 555 560 

Thr Trp Glu Asn Val Arg Gin Thr Ala Gly Gly Ala Val Asn Gin His 
565 570 575 

Lys Asn Val Gly Val Tyr Asn Arg Tyr Ala Ala Pro Gly Arg Asn Tyr 
580 585 590 

Thr Phe Ser Leu Glu Met Lys Phe 
595 600 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 607 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

Met Asn Lys Lys His Gly Phe Gin Leu Thr Leu Thr Ala Leu Ala Val 
15 10 15 

Ala Ala Ala Phe Pro Ser Tyr Ala Ala Asn Pro Glu Thr Ala Ala Pro 
20 25 30 

Asp Ala Ala Gin Thr Gin Ser Leu Lys Glu Val Thr Val Arg Ala Ala 
35 40 45 

Lys Val Gly Arg Arg Ser Lys Glu Ala Thr Gly Leu Gly Lys lie Ala 
50 55 60 

Lys Thr Ser Glu Thr Leu Asn Lys Glu Gin Val Leu Gly lie Arg Asp 
65 70 75 80 

Leu Thr Arg Tyr Asp Pro Gly Val Ala Val Val Glu Gin Gly Asn Gly 
85 90 95 

Ala Ser Gly Gly Tyr Ser He Arg Gly Val Asp Lys Asn Arg Val Ala 
100 105 110 

Val Ser Val Asp Gly Val Ala Gin He Gin Ala Phe Thr Val Gin Gly 
115 120 125 

Ser Leu Ser Gly Tyr Gly Gly Arg Gly Gly Ser Gly Ala He Asn Glu 
130 135 140 

He Glu Tyr Glu Asn He Ser Thr Val Glu He Asp Lys Gly Ala Gly 
145 150 155 160 

Ser Ser Asp His Gly Ser Gly Ala Leu Gly Gly Ala Val Ala Phe Arg 
165 170 175 
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Thr Lys Glu Ala Ala Asp Leu lie Ser Asp Gly Lys Ser Trp Gly He 
180 185 190 

Gin Ala Lys Thr Ala Tyr Gly Ser Lys Asn Arg Gin Phe Met Lys Ser 
195 200 205 

Leu Gly Ala Gly Phe Ser Lys Asp Gly Trp Glu Gly Leu Leu He Arg 
210 215 220 

Thr Glu Arg Gin Gly Arg Glu Thr His Pro His Gly Asp He Ala Asp 
225 230 235 240 

Gly Val Ala Tyr Gly He Asn Arg Leu Ser Val Cys Gly Tyr He Glu 
245 250 255 

Thr Leu Arg Ser Arg Lys Cys Val Pro Arg Lys lie Asn Gly Ser Asn 
260 265 210 

He His He Ser Leu Asn Asp Arg Phe Ser He Gly Lys Tyr Phe Asp 
275 280 285 

Phe Ser Leu Gly Gly Arg Tyr Asp Arg Lys Asn Phe Thr Thr Ser Glu 
290 295 300 

Glu Leu Val Arg Ser Gly Arg Tyr Val Asp Arg Ser Trp Asn Ser Gly 
305 310 315 320 

He Val Phe Lys Pro Asn Arg His Phe Ser Leu Ser Tyr Arg Ala Ser 
325 330 335 

Ser Gly Phe Arg Thr Pro Ser Phe Gin Glu Leu Phe Gly lie Asp He 
340 345 350 

Tyr His Asp Tyr Pro Lys Gly Trp Gin Arg Pro Ala Leu Lys Ser Glu 
355 360 365 

Lys Ala Ala Asn Arg Glu He Gly Leu Gin Trp Lys Gly Asp Phe Gly 
370 375 380 

Phe Leu Glu He Ser Ser Phe Arg Asn Arg Tyr Thr Asp Met He Ala 
385 390 



395 400 



Val Ala Asp His Lys Thr Lys Leu Pro Asn Gin Ala Gly Gin Leu Thr 
405 410 415 

Glu lie Asp He Arg Asp Tyr Tyr Asn Ala Gin Asn Met Ser Leu Gin 
420 425 43 0 

Gly Val Asn He Leu Gly Lys He Asp Trp Asn Gly Val Tyr Gly Lvs 
43S 440 445 

Leu Pro Glu Gly Leu Tyr Thr Thr Leu Ala Tyr Asn Arg He Lys Pro 
450 455 460 

Lys Ser Val Ser Asn Arg Pro Gly Leu Ser Leu Arg Ser Tyr Ala Leu 
465 470 475 480 

Asp Ala Val Gin Pro Ser Arg Tyr Val Leu Gly Phe Gly Tyr Asp Gin 
485 490 495 

Pro Glu Gly Lys Trp Gly Ala Asn He Met Leu Thr Tyr Ser Lys Gly 
500 505 510 

Lys Asn Pro Asp Glu Leu, Ala Tyr Leu Ala Gly Asp Gin Lys Arg Tyr 
515 520 525 
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Ser Thr Lys Arg Ala Ser Ser Ser Trp Ser Thr Ala Asp Val Ser Ala 
530 535 540 

Tyr Leu Asn Leu Lys Lys Arg Leu Thr Leu Arg Ala Ala lie Tyr Asn 
545 550 555 560 

He Gly Asn Tyr Arg Tyr Val Thr Trp Glu Ser Leu Arg Gin Thr Ala 
565 570 575 

Glu Ser Thr Ala Asn Arg His Gly Gly Asp Ser Asn Tyr Gly Arg Tyr 
580 585 590 

Ala Ala Pro Gly Arg Asn Phe Ser Leu Ala Leu Glu Met Lys Phe 
595 600 605 



<2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
AAACAGGTCT CGGCATAG 18 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CGCGAATTCA AACAGGTCTC GG CAT AG 2 7 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CGCGAATTCA AAAACTTCCA TTCCAGCGAT ACG 33 

(2) INFORMATION FOR SEQ ID NO: 14: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TAAAACTTCC ATTCCAGCGA TACG 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
AAACAGGTCT CGGCATAG 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGCGAATTCA AACAGGTCTC GGCATAG 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CGCGAATTCA AAAACTTCCA TTCCAGCGAT ACG 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
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<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TAAAACTTCC ATTCCAGCGA TACG 
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WHAT WE CLAIM IS: 

1. An isolated and purified recombinant nucleic acid encoding a 
hemoglobin receptor protein from a Neisseria species. 

2. An isolated and purified recombinant nucleic acid according to Claim 
5 1 , wherein the nucleic acid encodes a hemoglobin receptor protein having an amino 

acid sequence that is the amino acid sequence depicted as Seq. I D. No. 2. 

3 . An isolated and purified recombinant nucleic acid according to Claim 
1, wherein the nucleic acid encodes a hemoglobin receptor protein having an amino 
acid sequence that is the amino acid sequence depicted as Seq. ID. No. 4. 

10 4 - isolated P urif ied recombinant nucleic acid according to Claim 

1, wherein the nucleic acid encodes a hemoglobin receptor protein having an amino 
acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 6. 

5. An isolated and purified recombinant nucleic acid according to Claim 
1, wherein the nucleic acid encodes a hemoglobin receptor protein having an amino 
acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 8. 

6. A homogeneous preparation of a hemoglobin receptor protein from a 
Neisseria species. 

7. The hemoglobin receptor protein of Claim 6, wherein the protein has 
an amino acid sequence that is the amino acid sequence depicted as Seq ID No 

20 2. 

8. The hemoglobin receptor protein of Claim 6, wherein the protein has 
an amino acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 



15 



25 



8 

30 11 



4 

9. The hemoglobin receptor protein of Claim 6, wherein the protein has 
an amino acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 

6. 

10. The hemoglobin receptor protein of Claim 6, wherein the protein has 
an amino acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 



A recombinant expression construct comprising a nucleic acid that 
encodes a hemoglobin receptor protein from a Neisseria species. 
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12. A transformed cell culture comprising the recombinant expression 
construct of Claim 1 1 . 

13. A recombinant expression construct according to Claim 11, wherein 
the nucleic acid encodes a hemoglobin receptor protein having an amino acid 

5 sequence that is the amino acid sequence depicted as Seq. I.D. No. 2. 

14. A recombinant expression construct according to Claim 11, wherein 
the nucleic acid encodes a hemoglobin receptor protein having an amino acid 
sequence that is the amino acid sequence depicted as Seq. I.D. No. 4. 

15. A recombinant expression construct according to Claim 11, wherein 
the nucleic acid encodes a hemoglobin receptor protein having an amino acid 
sequence that is the amino acid sequence depicted as Seq. I.D. No. 6. 

16. A recombinant expression construct according to Claim 11, wherein 
the nucleic acid encodes a hemoglobin receptor protein having an amino acid 
sequence that is the amino acid sequence depicted as Seq. I.D. No. 8. 

15 17. A transformed cell culture comprising the recombinant expression 

construct of Claims 13, 14, 15 or 16. 

18. An antibody or antigen-binding fragment thereof that is 

immunologically reactive with an antigenic epitope of a hemoglobin receptor protein 

from a Neisseria species. 
20 l^. An antibody according to Claim 18 that is a monoclonal antibody. 

20. An antibody or antigen-binding fragment thereof according to Claim 
18 that is immunologically reactive with an antigenic epitope of the hemoglobin 
receptor protein depicted as Seq. I.D. No. 2. 

21. An antibody or antigen-binding fragment thereof according to Claim 
25 18 that is immunologically reactive with an antigenic epitope of the hemoglobin 

receptor protein depicted as Seq. I.D. No. 4. 

22. An antibody or antigen-binding fragment thereof according to Claim 
18 that is immunologically reactive with an antigenic epitope of the hemoglobin 
receptor protein depicted as Seq. I.D. No. 6. 

30 23. An antibody or antigen-binding fragment thereof according to Claim 

18 that is immunologically reactive with an antigenic epitope of the hemoglobin 
receptor protein depicted as Seq. I.D. No. 8. 



10 
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24. An antigenic epitope of a hemoglobin receptor protein from a 
Neisseria species. 

25. The antigenic epitope of Claim 24 wherein the hemoglobin receptor 
protein is the protein depicted as Seq. I.D. No. 2. 

26. The antigenic epitope of Claim 24 wherein the hemoglobin receptor 
protein is the protein depicted as Seq. I.D. No. 4. 

27. The antigenic epitope of Claim 24 wherein the hemoglobin receptor 
protein is the protein depicted as Seq. I.D. No. 6. 

28. The antigenic epitope of Claim 24 wherein the hemoglobin receptor 
protein is the protein depicted as Seq. I.D. No. 8. 

29. A diagnostic reagent for diagnosing a disease state in a human, 
wherein the disease state is caused by bacteria of a Neisseria species, the diagnostic 
reagent comprising an antibody according to Claims 18, 20, 21, 22, or 23. 

30. A diagnostic reagent for diagnosing a disease state in a human, 
wherein the disease state is caused by bacteria of a Neisseria species, the diagnostic 
reagent comprising an antibody according to Claim 19. 

31. A diagnostic reagent for diagnosing a disease state in a human, 
wherein the disease state is caused by bacteria of a Neisseria species, the diagnostic 
reagent comprising the nucleic acid of Claim 1 . 

32. A diagnostic reagent for diagnosing a disease state in a human, 
wherein the disease state is caused by bacteria of a Neisseria species, the diagnostic 
reagent comprising the nucleic acid of Claims 2, 3, 4 or 5. 

33. A therapeutic agent for treating a disease state in a human, wherein 
the disease state is caused by bacteria of a Neisseria species, the therapeutic agent 
comprising an antibody according to Claim 18, 20, 21, 22, or 23. 

34. A therapeutic agent for treating a disease state in a human, wherein 
the disease state is caused by bacteria of a Neisseria species, the therapeutic agent 
comprising an antibody according to Claim 19. 

35. A therapeutic agent for treating a disease state in a human, wherein 
the disease state is caused by bacteria of a Neisseria species, the therapeutic agent 
comprising the nucleic acid of Claim 1 or antisense homologue thereof. 



- 78 - 



96/12020 




PCT/US9S/13623 



36. A therapeutic agent for treating a disease state in a human, wherein 
the disease state is caused by bacteria of a Neisseria species, the therapeutic agent 
comprising the nucleic acid of Claims 2, 3, 4, or 5 or antisense homologue thereof. 

37. A therapeutic agent for treating a disease state in a human, wherein 
the disease state is caused by bacteria of a Neisseria species, the therapeutic agent 
comprising the recombinant expression construct of Claims 11, 13, 14, 15 or 16 or 
a homologue thereof that expresses the nucleic acid encoding a hemoglobin receptor 
in an antisense orientation. 

38. An antibody according to Claims 20, 21 , 22 or 23 that is a monoclonal 
antibody. 

39. A cell line that produces the monoclonal antibody of Claims 19 or 38. 

40. A method of treating a disease in a human caused by bacteria of a 
Neisseria species, the method comprising the step of administering a therapeutically- 
effective amount of the therapeutic agent of Claims 33, 34, 35, 36, or 37 in a 
phannaceutically-acceptable carrier. 

41. A method of diagnosing a disease in a human caused by bacteria of 
a Neisseria species, the method comprising the steps of contacting an amount of a 
detectably-labeled diagnostic reagent of Claims 29, 30, 31, or 32 to a biological 
sample from the human under conditions wherein the diagnostic reagent specifically 
binds to the Neisseria bacteria and detecting an amount of the specific binding to the 
biological sample. 

42. A vaccine that is effective in providing immunization against infection 
of a human with a bacteria of Neisseria species comprising a hemoglobin binding 
protein or antigenic fragment thereof. 

43. The vaccine of Claim 42 comprising the hemoglobin receptor protein 
of Claims 6, 7, 8, 9, or 10. 

44. The vaccine of Claim 42 comprising a nucleic acid encoding a 
hemoglobin receptor protein from a Neisseria species or antigenic fragment thereof. 

45. A vaccine according to Claim 44 comprising the nucleic acid of 
Claims 2, 3, 4, 5, 11, 13, 14, 15, or 16. 

46. The vaccine of Claim 42 comprising cells of the transformed cell 
culture of Claim 17. 
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47. A vaccine according to Claim 46 wherein the cells are attenuated 
bacterial cells. 

48. A vaccine according to Claim 47 wherein the ceils are Salmonella 

cells. 

5 49. The vaccine of Claim 42 comprising the epitope of the hemoglobin 

receptor protein of Claims 24, 25, 26, 27 or 28. 
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LBPA WKXXG FOLTLTALA VAAAT P SYAAKF ETAA POAAXTTV* ^^UXOAA^ 30 



LtfA -qm S KEA TTOCTT AA1 AH U OCgQVLCniPL m < UFC VAW 1Q OOASC 99 

HKB* KGffDU»»AAVPV*«iaJBaKO«HIKaf|^VKYSTP W gnSCB HQK- tt 



TBPlM CYS I HGKDKHUVSLTVPCVSQ I QSYT AO AA I /yTTKT AG S SCAINt I my 14? 
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Figure 7 



ATG AAA CCA TTA CAA ATG CCC CCT ATC GCC GCG CTG CTC GGC AGT ATT 4 8 

Met Lys Pro Leu Gin Met Pro Pro lie Ala Ala Leu Leu Gly Ser lie 
1 5 10 15 

TTC GGC AAT CCG GTC TTT GCG GCA GAT GAA GCT GCA ACT GAA ACC ACA 96 
Phe Gly Asn Pro Val Phe Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

CCC GTT AAG GCA GAG GTA AAA GCA GTG CGC GTT AAA GGT CAG CGC AAT 144 
Pro Val Lys Ala Glu Val Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 

GCG CCT GCG GCT GTG GAA CGC GTC AAC CTT AAC CGT ATC AAA CAA GAA 192 
Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg lie Lys Gin Glu 
50 55 60 

ATG ATA CGC GAC AAT AAA GAC TTG GTG CGC TAT TCC ACC GAT GTC GGC 24 0 

Met lie Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

TTG AGC GAC AGG AGC CGT CAT CAA AAA GGC TTT GCC ATT CGC GGC GTG 2 88 

Leu Ser Asp Arg Ser Arg His Gin Lys Gly Phe Ala lie Arg Gly Val 
85 90 95 

GAA GGC GAC CGT GTC GGC GTT AGT ATT GAC GGC GTA AAC CTG CCT GAT 33 6 

Glu Gly Asp Arg Val Gly Val Ser lie Asp Gly Val Asn Leu Pro Asp 
100 105 110 

TCC GAA GAA AAC TCG CTG TAC GCC CGT TAT GGC AAC TTC AAC AGC TCG 384 
Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGT CTG TCT ATC GAC CCC GAA CTC GTG CGC AAC ATC GAC ATC GTA AAA 43 2 

Arg Leu Ser lie Asp Pro Glu Leu Val Arg Asn lie Asp lie Val Lys 
130 135 140 

GGG GCG GAC TCT TTC AAT ACC GGC AGC GGC GCC TTG GGC GGC GGT GTG 48 0 

Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

AAT TAC CAA ACC CTG CAA GGA CGT GAC TTA CTG TTG CCT GAA CGG CAG 52 8 

Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Pro Glu Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAC AGC ACG CGT AAC CGT GAA TGG 576 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 190 

ACA AAT ACC CTC GGT TTC GGC GTG AGC AAC GAC CGC GTG GAT GCC GCT 624 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGG CGC GGC CAT GAA ACT GAA AGC GCG GGC AAG 672 
Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Lys 
210 215 220 

CGT GGT TAT CCG GTA GAG GGT GCT GGT AGC GGA GCG AAT ATC CGT GGT 72 0 

Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala Asn lie Arg Gly 
225 230 235 240 

TCT GCG CGC GGT ATT CCT GAT CCG TCC CAA CAC AAA TAC CAC AGC TTC 76 8 

Ser Ala Arg Gly lie Pro Asp Pro Ser Gin His Lys Tyr His Ser Phe 
245 250 255 

TTG GGT AAG ATT GCT TAT CAA ATC AAC GAC AAC CAC CGC ATC GGC GCA 816 
Leu Gly Lys lie Ala Tyr Gin lie Asn Asp Asn His Arg lie Gly Ala 
260 265 270 
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Figure 7 (cont'd.) 

TCG CTC AAC GGT CAG CAG GGG CAT AAT TAC ACG GTT GAA GAG TCT TAC 8 64 

Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 

AAC CTG CTT GCT TCT TAT TGG CGT GAA GCT GAC GAT GTC AAC AGA CGG 912 
Asn Leu Leu Ala Ser Tyr Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

CGT AAC ACC AAC CTC TTT TAC GAA TGG ACG CCG GAA TCC GAC CGG TTG 96 0 

Arg Asn Thr Asn Leu Phe Tyr Glu Trp Thr Pro Glu Ser Asp Arg Leu 
305 310 315 320 

TCT ATG GTA AAA GCG GAT GTC GAT TAT CAA AAA ACC AAA GTA TCT GCG 1008 
Ser Met Val Lys Ala Asp Val Asp Tyr Gin Lys Thr Lys Val Ser Ala 
325 330 335 

GTC AAC TAC AAA GGT TCG TTC CCG ACG AAT TAC ACC ACA TGG GAA ACC 1056 
Val Asn Tyr Lys Gly Ser Phe Pro Thr Asn Tyr Thr Thr Trp Glu Thr 
340 345 350 

GAG TAC CAT AAA AAG GAA GTT GGC GAA ATC TAT AAC CGC AGC ATG GAT 1104 
Glu Tyr His Lys Lys Glu Val Gly Glu lie Tyr Asn Arg Ser Met Asp 
355 360 365 

ACA ACC TTC AAA CGT ATT ACG CTG CGT ATG GAC AGC CAT CCG TTG CAA 115 2 

Thr Thr Phe Lys Arg lie Thr Leu Arg Met Asp Ser His Pro Leu Gin 
370 375 380 

CTC GGG GGG GGG CGA CAC CGC CTG TCG TTC AAA ACC TTT GCC GGG CAG 12 00 

Leu Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Gly Gin 
385 390 395 400 

CGT GAT TTT GAA AAC TTA AAC CGC GAC GAT TAC TAC TTC AGC GGC CGT ' 124 8 
Arg Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Gly Arg 
405 410 415 

GTT GTT CGA ACC ACC AAC AGT ATC CAG CAT CCG GTG AAA ACC ACC AAC 12 96 

Val Val Arg Thr Thr Asn Ser He Gin His Pro Val Lys Thr Thr Asn 
420 425 430 

TAC GGT TTC TCG CTG TCC GAC CAA ATC CAA TGG AAC GAC GTG TTC AGT " 1344 
Tyr Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe Ser 
435 440 445 

AGC CGC GCA GGT ATC CGT TAC GAC CAC ACC AAA ATG ACG CCT CAG GAA 13 92 

Ser Arg Ala Gly He Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu 
450 455 460 

TTG AAT GCC GAC TGT CAT GCT TGT GAC AAA ACA CCG CCT GCA GCC AAC 1440 
Leu Asn Ala Asp Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn 
465 470 475 480 

ACT TAT AAA GGC TGG AGC GGA TTT GTC GGC TTG GCG GCG CAG CTG AGC 1488 
Thr Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Ser 
485 490 495 

CAA ACA TGG CGT TTG GGT TAC GAT GTG ACC TCA GGT TTC CGC GTG CCG 1536 
Gin Thr Trp Arg Leu Gly Tyr Asp Val Thr Ser Gly Phe Arg Val Pro 
500 505 510 

AAT GCG TCT GAA GTG TAT TTC ACT TAC AAC CAC GGT TCG GGC ACT TGG 1584 
Asn Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Thr Trp 
515 520 525 
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Figure 7 (cont'd.) 

AAG CCT AAT CCT AAT TTG AAG GCA GAA CGC AGC ACC ACC CAC ACC CTG 163 2 

Lys Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu 
530 535 540 

TCC TTG CAG GGG CGC GGC GAC AAA GGG ACA CTG GAT GCC AAC CTG TAT 168 0 

Ser Leu Gin Gly Arg Gly Asp Lys Gly Thr Leu Asp Ala Asn Leu Tyr 
545 550 555 560 

CAA AGC AAT TAC CGA AAC TTC CTG TCG GAA GAG CAG AAT CTG ACT GTC 172 8 

Gin Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Asn Leu Thr Val 
565 570 575 

AGC GGC ACA CCC GGC TGT ACT GAG GAG GAT GCT TAC TAC TAT AGA TGC 17 76 

Ser Gly Thr Pro Gly Cys Thr Glu Glu Asp Ala Tyr Tyr Tyr Aro Cvs 
580 585 590 

AGC GAC CCC TAC AAA GAA AAA CTG GAT TGG CAG ATG AAA AAT ATC GAC 182 4 

Ser Asp Pro Tyr Lys Glu Lys Leu Asp Trp Gin Met Lys Asn lie Asp 
595 600 605 

AAG GCC AGA ATC CGC GGT ATC GAG TTG ACA GGC CGT CTG AAT GTG GAC 18 72 

Lys Ala Arg He Arg Gly He Glu Leu Thr Gly Arg Leu Asn Val Asp 
610 615 620 

AAA GTA GCG TCT TTT GTT CCT GAG GGT TGG AAA CTG TTC GGC TCG CTG 192 0 

Lys Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu 
625 630 635 640 

GGT TAT GCG AAA AGC AAA CTG TCG GGC GAC AAC AGC CTG CTG TCC ACA 1968 
Gly Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr 
«45 650 655 

CAG CCG CTG AAA GTG ATT GCC GGT ATC GAC TAT GAA AGT CCG AGC GAA 2 016 

Gin Pro Leu Lys Val He Ala Gly He Asp Tyr Glu Ser Pro Ser Glu 
660 665 670 

AAA TGG GGC GTA TTC TCC CGC CTG ACC TAT CTA GGC GCG AAA AAG GTC 2 064 

Lys Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Val 
675 680 685 

AAA GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC TGG GGT ACG CCT 2112 
Lys Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr Pro 
690 695 700 

TTG CAG AAA AAG GTA AAA GAT TAC CCG TGG CTG AAC AAG TCG GCT TAT 216 0 

Leu Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr 
70S 710 715 720 

GTG TTT GAT ATG TAC GGC TTC TAC AAA CCG GCT AAA AAC CTG ACT TTG 2208 
Val Phe Asp Met Tyr Gly Phe Tyr Lys Pro Ala Lys Asn Leu Thr Leu 
725 730 735 

CGT GCA GGC GTG TAC AAC CTG TTC AAC CGC AAA TAC ACC ACT TGG GAT 2256 
Arg Ala Gly Val Tyr Asn Leu Phe Asn Arg Lys Tyr Thr Thr Trp Asp 
740 745 750 

TCC CTG CGC GGT TTA TAT AGC TAC AGC ACC ACC AAT GCG GTC GAC CGC 2304 
Ser Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arg 
755 760 765 

GAT GGC AAA GGC TTA GAC CGC TAC CGC GCC CCA GGC CGC AAT TAC GCC 2352 
Asp Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Gly Arg Asn Tyr Ala 
770 775 780 
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Figure 7 (cont'd.) 

GTA TCG CTG GAA TGG AAG TTT TAA 
Val Ser Leu Glu Trp Lys Phe * 
785 790 
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Figure 8 

ATG AAA CCA TTA CAA ATG CTC CCT ATC GCC GCG CTG GTC GGC AGT ATT 
Met Lys Pro Leu Gin Met Leu Pro He Ala Ala Leu Val Gly Ser He 
1 5 10 15 

TTC GGC AAT CCG GTC TTT GCG GCA GAT GAA GCT GCA ACT GAA ACC ACA 
Phe Gly Asn Pro Val Phe Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

CCC GTT AAG GCA GAG GTA AAA GCA GTG CGC GTT AAA GGC CAG CGC AAT 
Pro Val Lys Ala Glu Val Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 

GCG CCT GCG GCT GTG GAA CGC GTC AAC CTT AAC CGT ATC AAA CAA GAA 
Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg He Lys Gin Glu 
50 55 60 

ATG ATA CGC GAC AAC AAA GAC TTG GTG CGC TAT TCC ACC GAT GTC GGC 
Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

TTG AGC GAC AGC GGC CGC CAT CAA AAA GGC TTT GCT GTT CGC GGC GTG 
Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

GAA GGC AAC CGT GTC GGC GTG AGC ATA GAC GGC GTA AAC CTG CCT GAT 
Glu Gly Asn Arg Val Gly Val Ser He Asp Gly Val Asn Leu Pro Asp 
100 105 110 

TCC GAA GAA AAC TCG CTG TAC GCC CGT TAT GGC AAC TTC AAC AGC TCG 
Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGT CTG TCT ATC GAC CCC GAA CTC GTG CGC AAC ATC GAC ATC GTA AAA 
Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Asp He Val Lys 
130 135 140 

GGG GCG GAC TCT TTC AAT ACC GGC AGC GGC GCC TTG GGC GGC GGT GTG 
Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

AAT TAC CAA ACC CTG CAA GGA CGT GAC TTA CTG TTG CCT GAA CGG CAG 
Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Pro Glu Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAC AGC ACG CGT AAC CGT GAA TGG 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 190 

ACA AAT ACC CTC GGT TTC GGC GTG AGC AAC GAC CGC GTG GAT GCC GCT 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGG CGC GGC CAT GAA ACT GAA AGC GCG GGC AAG 
Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Lys 
210 215 220 

CGT GGT TAT CCG GTA GAG GGT GCT GGT AGC GGA GCG AAT ATC CGT GGT 
Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala Asn He Arg Gly 
225 230 235 240 

TCT GCG CGC GGT ATT CCT GAT CCG TCC CAA CAC AAA TAC CAC AGC TTC 
Ser Ala Arg Gly He Pro Asp Pro Ser Gin His Lys Tyr His Ser Phe 
245 250 255 
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Figure 8 (cont.'d). 

TTG GGT AAG ATT GCT TAT CAA ATC AAC GAC AAC CAC CGC ATC GGC GCA 816 
Leu Gly Lys lie Ala Tyr Gin lie Asn Asp Asn His Arg lie Gly Ala 
260 265 270 

TCG CTC AAC GGT CAG CAG GGG CAT AAT TAC ACG GTT GAA GAG TCT TAC 8 64 

Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 

AAC CTG CTT GCT TCT TAT TGG CGT GAA GCT GAC GAT GTC AAC AGA CGG 912 
Asn Leu Leu Ala Ser Tyr Trp Arg Glu Ala Asp Asp Val Asn Arq Arq 
290 295 300 

CGT AAC ACC AAC CTC TTT TAC GAA TGG ACG CCG GAA TCC GAC CGG TTG 96 0 

Arg Asn Thr Asn Leu Phe Tyr Glu Trp Thr Pro Glu Ser Asp Arq Leu 
305 310 315 320 

TCT ATG GTA AAA GCG GAT GTC GAT TAT CAA AAA ACC AAA GTA TCT GCG 100 8 

Ser Met Val Lys Ala Asp Val Asp Tyr Gin Lys Thr Lys Val Ser Ala 
325 330 335 

GTC AAC TAC AAA GGT TCG TTC CCG ATA GAG GAT TCT TCC ACC TTG ACA 1056 
Val Asn Tyr Lys Gly Ser Phe Pro lie Glu Asp Ser Ser Thr Leu Thr 
340 345 350 

CGT AAC TAC AAT CAA AAG GAC TTG GAT GAA. ATC TAC AAC CGC AGT ATG 1104 
Arg Asn Tyr Asn Gin Lys Asp Leu Asp Glu lie Tyr Asn Arg Ser Met 
355 360 365 

GAT ACC CGC TTC AAA CGC ATT ACC CTG CGT TTG GAC AGC CAT CCG TTG 1152 
Asp Thr Arg Phe Lys Arg lie Thr Leu Arg Leu Asp Ser His Pro Leu 
370 375 380 

CAA CTC GGG GGG GGG CGA CAC CGC CTG TCG TTT AAA ACT TTC GCC AGC 12 00 

Gin Leu Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Ser 
385 390 395 400 

CGC CGT GAT TTT GAA AAC CTA AAC CGC GAC GAT TAT TAC TTC AGC GGC 12 4 8 

Arg Arg Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Gly 
405 410 415 

CGT GTT GTT CGA ACC ACC AGC AGT ATC CAG CAT CCG GTG AAA ACC ACC 12 96 

Arg Val Val Arg Thr Thr Ser Ser lie Gin His Pro Val Lys Thr Thr - 
420 425 430 

AAC TAC GGT TTC TCA CTG TCT GAC CAA ATT CAA TGG AAC GAC GTG TTC 1344 
Asn Tyr Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe 
435 440 445 

AGT AGC CGC GCA GGT ATC CGT TAC GAT CAT ACC AAA ATG ACG CCT CAG 13 92 

Ser Ser Arg Ala Gly He Arg Tyr Asp His Thr Lys Met Thr Pro Gin 
. 450 455 460 

GAA TTG AAT GCC GAG TGT CAT GCT TGT GAC AAA ACA CCG CCT GCA GCC 144 0 

Glu Leu Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala 
465 470 475 480 

AAC ACT TAT AAA GGC TGG AGC GGT TTT GTC GGC TTG GCG GCG CAA CTG 148 8 

Asn Thr Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu 
485 490 495 

AAT CAG GCT TGG CGT GTC GGT TAC GAC ATT ACT TCC GGC TAC CGT GTC 1536 
Asn Gin Ala Trp Arg Val Gly Tyr Asp He Thr Ser Gly Tyr Arg Val 
500 505 510 
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Figure 8 (cont^.) 

CCC AAT GCG TCC GAA GTG TAT TTC ACT TAC AAC CAC GGT TCG GGT AAT 158 4 

Pro Asn Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn 
515 520 525 

TGG CTG CCC AAT CCC AAC CTG AAA GCC GAG CGC ACG ACC ACC CAC ACC 16 3 2 

Trp Leu Pro Asn Pro Asn Leu Lys Ala Glu Arg Thr Thr Thr His Thr 
530 535 540 

CTC TCT CTG CAA GGC CGC AGC GAA AAA GGT ACT TTG GAT GCC AAC CTG 1680 
Leu Ser Leu Gin Gly Arg Ser Glu Lys Gly Thr Leu Asp Ala Asn Leu 
545 550 555 560 

TAT CAA AGC AAT TAC CGC AAT TTC CTG TCT GAA GAG CAG AAG CTG ACC 172 8 

Tyr Gin Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr 
565 570 575 

ACC AGC GGC GAT GTC AGC TGT ACT CAG ATG AAT TAC TAC TAC GGT ATG 1776 
Thr Ser Gly Asp Val Ser Cys Thr Gin Met Asn Tyr Tyr Tyr Gly Met 
580 585 590 

TGT AGC AAT CCT TAT TCC GAA AAA CTG GAA TGG CAG ATG CAA AAT ATC 18 24 

Cys Ser Asn Pro Tyr Ser Glu Lys Leu Glu Trp Gin Met Gin Asn lie 
595 600 605 

GAC AAG GCC AGA ATC CGC GGT ATC GAG CTG ACG GGC CGT CTG AAT GTG 18 72 

Asp Lys Ala Arg lie Arg Gly lie Glu Leu Thr Gly Arg Leu Asn Val 
610 615 620 

GAC AAA GTA GCG TCT TTT GTT CCT GAG GGC TGG AAA CTG TTC GGC TCG 192 0 

Asp Lys Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser 
625 630 635 640 

CTG GGT TAT GCG AAA AGC AAA CTG TCG GGC GAC AAC AGC CTG CTG TCC 196 8 

Leu Gly Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser 
645 650 655 

ACC CAG .CCG TTG AAA GTG ATT GCC GGT ATC GAC TAT GAA AGT CCG AGC 2 016 

Thr Gin Pro Leu Lys Val lie Ala Gly lie Asp Tyr Glu Ser Pro Ser 
660 665 670 

GAA AAA TGG GGC GTG TTC TCC CGC CTG ACC TAT CTG GGC GCG AAA AAG 2 06 4 

Glu Lys Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys 
675 680 685 

GTC AAA GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC TGG GGT ACG 2112 
Val Lys Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr 
690 695 700 

CCT TTG CAG AAA AAG GTA AAA GAT TAC CCG TGG CTG AAC AAG TCG GCT 2160 
Pro Leu Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala 
705 710 715 720 

TAT GTG TTC GAT ATG TAC GGC TTC TAC AAA CCG GTG AAA AAC CTG ACT 22 08 

Tyr Val Phe Asp Met Tyr Gly Phe Tyr Lys Pro Val Lys Asn Leu Thr 
725 730 735 

TTG CGT GCA GGC GTA TAT AAT GTG TTC AAC CGC AAA TAC ACC ACT TGG 2256 
Leu Arg Ala Gly Val Tyr Asn Val Phe Asn Arg Lys Tyr Thr Thr Trp 
740 745 750 

GAT TCC CTG CGC GGC CTG TAT AGC TAC AGC ACC ACC AAC TCG GTC GAC 2 3 04 

Asp Ser Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ser Val Asp 
755 760 765 
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Figure 8 (cont'd. ) 

CGC GAT GGC AAA GGC TTA GAC CGC TAC CGC GCC CCA AGC CGT AAT TAC 
Arg Asp Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Ser Arg^sn 5£r 
770 775 780 

GCC GTA TCG CTG GAA TGG AAG TTT TAA 
Ala Val Ser Leu Glu Trp Lys Phe * 
785 790 
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Figure 9 

ATG AAA CCA TTA CAC ATG CTT CCT ATT GCC GCG CTG GTC GGC AGT ATT 4 8 

Met Lys Pro Leu His Met Leu Pro lie Ala Ala Leu Val Gly Ser lie 
15 io 15 

TTC GGC AAT CCG GTC TTG GCA GCG GAT GAA GCT GCA ACC GAA ACC ACA 96 
Phe Gly Asn Pro Val Leu Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

CCC GTT AAA GCA GAG ATA AAA GAA GTG CGC GTT AAA GAC CAG CTT AAT 14 4 

Pro Val Lys Ala Glu lie Lys Glu Val Arg Val Lys Asp Gin Leu Asn 
35 40 45 

GCG CCT GCA ACC GTG GAA CGT GTC AAC CTC GGC CGC ATT CAA CAG GAA 192 
Ala Pro Ala Thr Val Glu Arg Val Asn Leu Gly Arg lie Gin Gin Glu 
50 55 60 

ATG ATA CGC GAC AAC AAA GAC TTG GTG CGT TAC TCC ACC GAC GTC GGC 24 0 

Met lie Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

TTG AGC GAT AGC GGC CGC CAT CAA AAA GGC TTT GCT GTG CGC GGC GTG 2 88 

Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

GAA GGC AAC CGT GTC GGT GTC AGC ATT GAC GGC GTG AGC CTG CCT GAT 3 36 

Glu Gly Asn Arg Val Gly Val Ser lie Asp Gly Val Ser Leu Pro Asp 
100 105 110 

TCG GAA GAA AAC TCA CTG TAT GCA CGT TAT GGC AAC TTC AAC AGC TCG 3 84 

Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGC CTG TCT ATC GAC CCC GAA CTC GTG CGC AAC ATC GAA ATC GCG AAG 432 
Arg Leu Ser lie Asp Pro Glu Leu Val Arg Asn lie Glu lie Ala Lys 
130 135 140 

GGC GCT GAC TCT TTC AAT ACC GGT AGC GGC GCA TTG GGT GGC GGC GTG 4 80 

Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

AAT TAC CAA ACC CTG CAA GGA CAT GAT TTG CTG TTG GAC GAC AGG CAA 528 
Asn Tyr Gin Thr Leu Gin Gly His Asp Leu Leu Leu Asp Asp Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAC AGC AGC CGC AAC CGC GAA TGG 576 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Ser Arg Asn Arg Glu Trp 
180 185 190 

ACA AAT ACA CTC GGT TTC GGT GTG AGC AAC GAC CGC GTG GAT GCC GCT 624 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGT CGC GGT CAT GAG ACC GAA AGC GCG GGC GAG 672 
Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Glu 
210 215 220 

CGT GGC TAT CCG GTA GAG GGT GCT GGC AGC GGA GCA ATT ATC CGT GGT 720 
Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala lie lie Arg Gly 
225 230 235 240 

TCG TCA CGC GGT ATC CCT GAT CCG TCC AAA CAC AAA TAC CAC AAC TTC 768 
Ser Ser Arg Gly lie Pro Asp Pro Ser Lys His Lys Tyr His Asn Phe 
245 250 255 
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Figure 9 (cont'd.) 

TTG GGT AAG ATT GCT TAT CAA ATC AAC GAC AAG CAC CGC ATC GGC CCA 816 
Leu Gly Lys He Ala Tyr Gin He Asn Asp Lys His Arg lie Gly Pro 
260 265 270 

TCG TTT AAC GGC CAG CAG GGG CAT AAT TAC ACG ATT GAA GAG TCT TAT 864 
Ser Phe Asn Gly Gin Gin Gly His Asn Tyr Thr He Glu Glu Ser Tyr 
275 280 285 

AAC CTG ACC GCT TCT TCC TGG CGC GAA GCC GAT GAC GTA AAC AGA CGG 912 
Asn Leu Thr Ala Ser Ser Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

CGC AAT GCC AAC CTC TTT TAC GAA TGG ACG CCT GAT TCA AAT TGG CTG 96 0 

Arg Asn Ala Asn Leu Phe Tyr Glu Trp Thr Pro Asp Ser Asn Trp Leu 
305 310 315 320 

TCG TCT TTG AAG GCG GAC TTC GAT TAT CAG ACA ACC AAA GTG GCG GCG 1008 
Ser Ser Leu Lys Ala Asp Phe Asp Tyr Gin Thr Thr Lys Val Ala Ala 
325 330 335 

GTT AAC AAC AAA GGC TCG TTC CCG ACG GAT TAT TCC ACC TGG ACG CGC 1056 
Val Asn Asn Lys Gly Ser Phe Pro Thr Asp Tyr Ser Thr Trp Thr Arg 
340 345 350 

AAC TAT AAT CAG AAG GAT TTG GAG AAT ATA TAC AAC CGC AGC ATG GAC 1104 
Asn Tyr Asn Gin Lys Asp Leu Glu Asn He Tyr Asn Arg Ser Met Asp 
355 360 365 

ACC CGA TTC AAA CGT TTT ACT TTG CGT ATG GAC AGC CAA CCG TTG CAA 1152 
Thr Arg Phe Lys Arg Phe Thr Leu Arg Met Asp Ser Gin Pro Leu Gin 
370 375 380 

CTG GGC GGC CAA CAT CGC TTG TCG CTT AAA ACT TTC GCC AGT CGG CGT " 1200 
Leu Gly Gly Gin His Arg Leu Ser Leu Lys Thr Phe Ala Ser Arg Arg 
385 390 395 400 

GAG TTT GAA AAC TTA AAC CGC GAC GAT TAT TAC TTC AGC GAA AGA GTA 1248 
Glu Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Glu Arg Val 
405 410 415 

TCC CGT ACT ACC AGC TCG ATT CAA CAC CCC GTG AAA ACC ACT AAT TAT ' 12 96 
Ser Arg Thr Thr Ser Ser He Gin His Pro Val Lys Thr Thr Asn Tyr 
420 425 430 

GGT TTC TCA CTG TCT GAT CAA ATC CAA TGG AAC GAC GTG TTC AGC AGC 1344 
Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe Ser Ser 
435 440 445 

CGT GCA GAT ATC CGT TAC GAT CAT ACC AAA ATG ACG CCT CAG GAA TTG 13 92 

Arg Ala Asp He Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu Leu 
450 455 460 

AAT GCC GAG TGT CAT GCT TGT GAC AAA ACA CCG CCT GCA GCC AAT ACT 144 0 

Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn Thr 
465 470 475 480 

TAT AAA GGC TGG AGC GGA TTT GTC GGT TTG GCG GCG CAA CTG AAT CAG 1488 
Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Asn Gin 
485 490 495 

GCT TGG CAT GTC GGT TAC GAC ATT ACT TCC GGC TAC CGT GTC CCC AAT 1536 
Ala Trp His Val Gly Tyr Asp He Thr Ser Gly Tyr Arg Val Pro Asn 
500 505 510 
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Figure 9 (cont'd-) 

GCG TCC GAA GTG TAT TTC ACT TAC AAC CAC GGT TCG GGT AAT TGG CTG 1584 
Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn Trp Leu 
515 520 525 

CCC AAT CCC AAC CTG AAA GCC GAG CGC AGC ACC ACC CAC ACC CTG TCT 163 2 

Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu Ser 
530 535 540 

CTG CAA GGC CGC AGC GAA AAA GGT ACT TTG GAT GCC AAC CTG TAT CAA 
Leu Gin Gly Arg Ser Glu Lys Gly Thr Leu Asp Ala Asn Leu Tyr Gin 
545 550 555 560 

AAC AAT TAC CGC AAC TTC TTG TCT GAA GAG CAG AAG CTG ACC ACC AGC 172 8 

Asn Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr Thr Ser 
565 570 575 

GGC GAT GTC GGC TGT ACT CAG ATG AAT TAC TAC TAC GGT ATG TGT AGC 17 76 

Gly Asp Val Gly Cys Thr Gin Met Asn Tyr Tyr Tyr Gly Met Cys Ser 
580 585 590 

AAT CCT TAT TCC GAA AAA CCG GAA TGG CAG ATG CAA AAT ATC GAT AAG 1824 
Asn Pro Tyr Ser Glu Lys Pro Glu Trp Gin Met Gin Asn He Asp Lys 
595 600 605 

GCC CGA ATC CGT GGT CTT GAG CTG ACA GGC CGT CTG AAT GTG ACA AAA 18 72 

Ala Arg He Arg Gly Leu Glu Leu Thr Gly Arg Leu Asn Val Thr Lys 
610 615 620 

GTA GCG TCT TTT GTT CCT GAG GGC TGG AAA TTG TTC GGC TCG CTG GGT 192 0 

Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu Glv 
625 630 635 640 

TAT GCG AAA AGC AAA CTG TCG GGC GAC AAC AGC CTG CTG TCC ACA CAG 196 8 

Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr Gin 
€45 650 655 

CCG CCG AAA GTG ATT GCC GGT GTC GAC TAC GAA AGC CCG AGC GAA AAA 2016 
Pro Pro Lys Val He Ala Gly Val Asp Tyr Glu Ser Pro Ser Glu Lys 
660 665 670 

TGG GGT GTG TTC TCC CGC CTG ACT TAT CTG GGT GCG AAA AAG GCC AAA 2064 
Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Ala Lys 
675 680 685 

GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC CGG GGT ACG CCT TTG 2112 
Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Arg Gly Thr Pro Leu 
690 695 700 

CAG AAA AAG GTA AAA GAT TAC CCG TGG CTG AAC AAG TCG GCT TAT GTG 2160 
Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr Val 
70S 710 715 720 

TTT GAT ATG TAC GGC TTC TAC AAA CTG GCT AAA AAC CTG ACT TTG CGT 22 08 

Phe Asp Met Tyr Gly Phe Tyr Lys Leu Ala Lys Asn Leu Thr Leu Arg 
725 730 735 

GCA GGC GTA TAT AAT GTG TTC AAC CGC AAA TAC ACC ACT TGG GAT TCC 2256 
Ala Gly Val Tyr Asn Val Phe Asn Arg Lys Tyr Thr Thr Trp Asp Ser 
740 745 750 

CTG CGC GGT TTG TAT AGC TAC AGC ACC ACC AAC GCG GTC GAC CGA GAT 2304 
Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arg Asp 
755 760 765 
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Figure 9 (cont'd.) 

GGC AAA GGC TTA GAC CGC TAG CGC GCC TCA GGC CGT AAT TAG GCC GTA 23 52 

Gly Lys Gly Leu Asp Arg Tyr Arg Ala Ser Gly Arg Asn Tyr Ala Val 
770 775 780 



TCG CTG GAT TGG AAG TTT TGA ATTCC 
Ser Leu Asp Trp Lys Phe * 
785 790 
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HMBRA MXP LQML P I AALVGS I FGNP VLAAD EAATETT PVKAE I KAVRVKGQRNAF 50 

HMBRB MKPLQMPPIAALLGSIFGNPVFAXD EAATETT PVKAITVKAVRVKGQRNAP 50 

HMBRC MKPLQMLPIAALVGSIFGNPVTAADEAATETTPVKAEVT^VRVKGQRNAP 50 

HMBRMS11 MK P LHML P I AAL VGS I FGNPVLAADEAATETTPVKAEI KEVRVKDQLNAP 50 

HMBRA AAVERVNLNRI KQEMIRDNKDLVRYSTDVGLSDSGRHQKGFAVRGVEGNR 100 

HMBRB AAVERVNLNRIKQEMIRDNKDLVRYSTDVGLSDRSRHQKGFAIRGVEGDR 100 

HMBRC AAVERVNLNRI KQEMIRDNKDLVRY STDVGLSDSGRHQKGFAVRGVEGNR 100 

HMBRMS1 1 A TVER VNLGR I QQ EM I RDNKDLVRY STDVGLSDSGRHQKGFAVRGVEGNR 100 
* _ ****** * * _ *•****••***♦•****•*»* ^ ^ ******* _ * * * * • ^ * 

HMBRA VGVSIDGVNLPDSEENSLYARYGNFNSSRLSIDPELVRNIEIVKGADSFN 150 

HMBRB VGVS IDGVNLPDSEENSLYARYGNFNSSRLSIDPELVRNIDIVKGADSFN 150 

HMBRC VGVS I DGVNLPDSEENSLYARYGNFNSSRLS I DP ELVRNIDIVKGADS FN 150 

HMBRMS11 VGVS I DGVSLPDSEENSLYARYGNFNSSRLS ID PELVRNIEIAKGADSFN 150 

HMBRA TGSGALGGGV^OTLQGRDLLLDDRQFGVMMKNGYSTRNREWTNTLGFGV 200 

HMBRB TGSGA1X3GGVNYQTLQGRDLLLPERQFGVMMKNGYSTRNREWTNTLGFGV 200 

HMBRC TGSGAUXXrVNYQTLQGRDLLLPERQFGVMMKNGYSTRNKEWTOTLGFW 200 

HMBRMS 1 1 TGSGALGGGV^^V'(^LQGHDLLLDDROFGVMMKNGYSSRNREWTIT^LGFGV 200 

HMBRA SNDRVDAALLYSQRRGHETESAGNRGYPVEGAGKETNIRGSARGI PDPSK 250 

HMBRB SNDRVDAALLYSQRRGHETESAGKRGYPVEGAGSGANIRGSARGIPDPSQ 250 

HMBRC SNDRVDAALL YSQRRGHETES AGKRGYPVEGAGSGANIRGSARGI PDPSQ 250 

HMBRMS 1 1 SNDRVDAALLYSQRRGHETESAGERGYPVEGAGSGAI IRGSSRGI PDPSK 250 

HMBRA HKYHNF LGK I AYQ 1 KDNHR I GAS LNGQQGHNYTVEESYNLTAS5WREADD 300 

HMBRB HKYHSFI^KIAYQINDNHRIGASLNGQQGHNYTVEESYNLI-ASYWREADD 300 

HMBRC HKYHSFLGKIAYQINDNHRIGASLNGQQGHNYTVEESYNL^ 300 

HMBRMS 11 HKYHNF LGKIAYQINDKHRIGPSFNGQOGHNYTIEESYNLTASSWREADD 300 

HMBRA VNRRJWANLF'YEWMPDSNVn^SIJCADroYQICTIW 348 

HMBRB VNRRBNTNLFYEWTPESDRLSMVKADVDYQKTI^ 349 

HMBRC VTTORRNTNLFYEOTPESDRLSMVTCADVDYQKTKVSAVNYTCGSPPIEDS^ 350 

HMBRMS 11 VNRRRNANLFY^OTPDSNWI*SSIJCADroYQTTICVAAVNNKGS 349 
************ *-*..** •••• •*•.•*•.♦.• **•*•. .* 

HMBRA WETEYHKKEVGEIYNRSMDTRFJGIFTIJII^SHPL^ 398 

HMBRB V^EYHKKEVGEIYNRSMDTTFKRITUIMDSW 399 

HMBRC LTRNYNQKDIXEIYNRSMDTRFKRITIJU^SHPI^LGTC 400 

HMBRMS 11 VrTRNYNQKDLENIYNRSMDTRFKJOTIJUm 398 

HMBRA RRDFENLNRDDYYFSGR WRTTSS I QHPVKTTNYGFSLSDQIQWNDVFSS 448 

HMBRB QRDFEWUmDDYYFSGRVVRTTNSIQHPNnCTTNYGFSLSDQIQ 449 

HMBRC RRDFENLNRDDYYFSGR\A^TTSSIQHPVKTTNYGFSLSDQIQWND 450 

HMBRMS 11 RREFENLNRDDYYFSERVSRTTSSIQHPVKTTNYGFSLSDQIQWNDW 448 



HMBRA 
HMBRB 
HMBRC 



RAG IR YDHTKKTPQELNAECHACDKTP PAANTYKGWSGPVGLAAOLNQAW 498 
RAGIRYDHTKMTPQELNADCHACDKTP PAANTYKGWSGFVGLAAQLSQTW 499 
RAG I R YDHTKMT PQELNAECHACDKT P PAANTYKGWSGFVGLAAQLNQAff S00 
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HMBRA RVGYDI TSG YRVPNAS EVYFTYNHGSGNWL PNPNLKAERSTTHTLS LQGR 

HMBRB RLGYDVTSGFRVPNASEVYFTYNHGSGTWKPNPNLKAERSTTHTLSI^GR 
HMBRC R VG YD I TSG YRVPNAS EVYFTYNHGSGNWL PNPNLKAERTTTHTLSLQGR 

HMBRMS1 1 HVG YD I TSG YRVPNAS EVYFTYNHGSGNWL PNPNLKAERSTTHTLS LQGR 



HMBRA 
HMBRB 
HMBRC 
HMBRMS1 1 



S EKGMLDANL YQSNYRNFLSEEQKLTTSGTPGCTEENAYYS ICSDPYKEK 
GDKGTLDANLYQSNYRNFLSEEQNLTVSGTPGCTEEDAYYYRCSDPYKEK 
SEKGTLDANLYQSNYRNFLSEEQKLTTSGDVSCTQMNYYYGMCSNPYSEK 
SEKGTLDANLYQNNYRNFLSEEQKLTTSGDVGCTQMNYYYGMCSNPYSEK 



HMBRA 
HMBRB 
HMBRC 
HMBRMS11 



-DWQMKN I DKAR I RG I ELTGRLNVDKV AS FVPEGWKLFGSLG YAKS KLSG 
LDWQMKN I DKAR I RG I ELTGRLNVDKVAS FVPEGWKLFG S LGYAKS KLSG 
LEWQMQN I DKARI RG I ELTGRLNVDKV AS FVP EGWKLFGS LGYAKS KLSG 
PEWQMQNZDKARIRGLELTGRLNVTKVASFVPEGWKLFGSLGYAKSKLSG 



HMBRA 
HMBRB 
HMBRC 
HMBRMS1I 



DNSLLSTQPLKVIAGIDYESPSEKWGVFSRLTYLGAKKVKDAQYTVYENK 
DNSLLSTQPLKVIAGIDYESPSEKWGWSRLTYLGAKKVKDAQYTVYENK 
DNSLLSTQPLKVIAGIDYESPSEKWGVFSRLTYLGAKXVKDAQYTVYENK 
DNSLLSTQPPKVIAGVDYESPSEKWGVFSRLTYLGAKKAKDAQYTVYENK 



HMBRA 
HMBRB 
HMBRC 
HMBRMS1 1 



GWGTPLQKKVKDYPWUWSAYVFDMYGFTKPVKNLTLRAGVYNL^ 
GWGT P LQKKVKD Y PWLNK S A YVFDMYGFYKP AKNLTLRAGVYNLFNRKYT 
GWGTPLQKKVKDYPWIJWSAYVFDMYGFYKPVKNLTLRAGVYNVFNRKYT 
GRGTPLQKKVKDYPWI^SAYVFDMYGFYKIJUCNLTU^^ 



HMBRA 
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HMBRC 
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TWDS LRGLYSYSTTNAVDRDGKGLDRYRAPGRNYAVSLEWKF 790 
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TWDSLRGLYSYSTTNAVDRDGKGLDRYRASGRNYAVSLDWKF 790 
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(57) Abstract 



The present invention relates to novel bacterial hemoglobin receptor proteins and genes that encode such proteins. The invention 
is directed toward the isolation, characterization, diagnostic and therapeutic use of bacterial hemoglobin receptor proteins, nucleic acids 
encoding such proteins, recombinant expression constructs comprising such nucleic acids and cells transformed therewith, and antibodies 
and epitopes of such hemoglobin receptor proteins.The invention relates particularly to hemoglobin receptor proteins and genes encoding 
such proteins from Neisseria species, especially N. meningitidis and serotypes thereof, and N. gonorrhoeae. Methods for the diagnostic and 
therapeutic use of the proteins, epitopes, antibodies and nucleic acids of the invention are also provided, including the use of the proteins, 
epitopes, antibodies and nucleic acids of the invention for the production of vaccines effective in providing immunization of a human against 
infection by pathogenic bacteria of Neisseria species. 
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(57) Abstract 



The present invention relates to novel bacterial hemoglobin receptor proteins and genes that encode such proteins. The invention 
is directed toward the isolation, characterization, diagnostic and therapeutic use of bacterial hemoglobin receptor proteins, nucleic acids 
encoding such proteins, recombinant expression constructs comprising such nucleic acids and cells transformed therewith, and antibodies 
and epitopes of such hemoglobin receptor proteins.The invention relates particularly to hemoglobin receptor proteins and genes encoding 
such proteins from Neisseria species, especially N. meningitidis and serotypes thereof, and N. gonorrhoeae. Methods for the diagnostic and 
therapeutic use of the proteins, epitopes, antibodies and nucleic acids of the invention are also provided, including the use of the proteins, 
epitopes, antibodies and nucleic acids of the invention for the production of vaccines effective in providing immunization of a human against 
infection by pathogenic bacteria of Neisseria species. 
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HEMOGLOBIN RECEPTORS FROM NEISSERIAS 

This invention was made with government support under National Institute 
of Health grants R01 AI32493 and R01 AI22933. The U.S. government has certain 
rights to this invention. 



1. Field of the Invention 

This invention relates to hemoglobin receptor genes and the proteins encoded 
therefrom of certain bacterial species, particularly species of Neisseria bacteria. 
More particularly, this invention relates to hemoglobin receptor genes, polypeptides 
and peptides useful for preparing vaccines and antibodies against Neisseria, and 
methods and means for producing such peptides and polypeptides in vitro. Also 
provided are diagnostic and therapeutic methods and reagents useful in detecting and 
treating Neisseria infection and methods for developing novel and effective anti- 
Neisseria agents. 

2. Background of the Invention 

The Neisseriae comprise a genus of bacteria that includes two gram-negative 
species of pyogenic cocci pathogenic for humans: Neisseria meningitidis and 
Neisseria gonorrhoeae. N. meningitidis is a major cause of bacterial meningitis in 
humans, especially children. The disease characteristically proceeds from 
asymptomatic carriage of the bacterium in the nasopharynx to invasion of the 
bloodstream and cerebrospinal fluid in susceptible individuals. 

Neisseria meningitidis is one of the leading causes of bacterial meningitis in 
children and healthy adults in the world. The severity of the disease is evidenced 
by the ability of meningococci to cause the death of previously healthy individuals 
in less than 24 hours. N. meningitidis has a polysaccharide capsule whose diversity 
of component antigenic polysaccharide molecules has resulted in the classification of 
ten different sero groups. Of these, group A strains are the classic epidemic strains; 
group B and C are generally endemic strains, but C occasionally causes an epidemic 
outbreak. All known group A strains have the same protein antigens on their outer 
membranes, while group B strains have a dozen serotypes or groupings based on the 
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presence of principal outer membrane protein antigens (as opposed to 
polysaccharides) . 

Survival of a pathogen such as N. meningitidis in a host depends on its ability 
to overcome a battery of host defense mechanisms. One nonspecific host defense 
5 mechanism against microbial intruders is to limit the availability of iron in tissues 

(Weinberg, 1984, Physiological. Rev. 64i 65-102), because iron is a necessary 
nutrient for most microbial pathogens. The vast majority of iron in the human adult 
is located intracellularly in the form of hemoglobin (76%) or ferritin (23%). The 
remainder can be found extracellularly bound to host iron-binding proteins such as 

10 transferrin and lactoferrin (Otto et al., 1992, Crit. Rev. Microbiol. 18: 217-233). 

Pathogenic bacteria have adapted to this iron-limiting environment by 
developing highly specific and effective iron assimilation systems. A large number 
of these bacteria secrete siderophores, small, non-protein iron chelators which, due 
to their extremely high affinity for iron (III), scavenge trace amounts of iron(III) 

15 from the environment and shuttle the iron back to the bacterial cell (Baggs and 

Neilands, 1987, Microbiol. Rev. 51: 509-518; Braun and Hantke, 1991, in 
Winkelmann (ed.), Handbook of Microbial Iron Chelates, CRC Press: Boca Raton, 
Fla., pp. 107-138.). 

Alternatively, some bacterial pathogens, like Neisseriae species (Archilbald 

20 and DeVoe, 1979, FEMS Microbiol. Lett. 6: 159-162; Mickelson et al. , 1982, Infect. 

Immun. 35: 915-920; Dyer et al. 9 1987, Infect. Immun. 55: 2171-2175), 
Haemophilus influenzae (Coulton and Pang, 1983, Curr. Microbiol. 9: 93-98; 
Schryvers, 1988, MoL Microbiol 2: 467-472; Jarosik et al, 1994, Infect. Immun. 
62: 2470-2477), Vibrio cholerae (Stoebner and Payne, 1988, Infect. Immun. 56: 

25 2891-2895; Henderson and Payne, 1994, 7. BacterioL 176: 3269-3277), Yersiniae 

(Stojiljkovic and Hantke, 1992, EMBO J. U: 4359-4367) and Actinobacillus 
pleuropneumoniae (Gerlach et al., 1992, Infect, Immun. 60: 3253-3261) have 
evolved more sophisticated mechanisms to sequester iron from the host. These 
pathogens can directly bind host's iron-binding proteins such as lactoferrin, 

30 transferrin, and heme-containing compounds, and use them as sole sources of iron. 

The importance of iron in the virulence of N. meningitidis was demonstrated 
by in vivo studies using mice as the animal model system (Calver et al. , 1976, Can. 
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/. Microbiol 22: 832-838; Holbien et al, 1981, Infect. Immun. 34: 120-125). 
Specific iron-regulated outer membrane receptors have been shown to be involved 
in the binding and the utilization of lactoferrin- and transferrin-iron in Neisseriae 
(Schiyvers and Morris, 1988, Infect. Immun. 56: 1144-1149 and Mol. Microbiol. 2: 
5 281-288; Legrain et al., 1993, Gene 130: 81-90; Pettersson et al. 9 1993, Infect. 

Immun. 61: 4724-4733 and 1994, /. Bacteriol. 176: 1764-1766). These receptors 
share significant amino acid similarity and, most probably, also the mechanism of 
iron internalization, with receptors for siderophores and vitamin B12 of other Gram- 
negative bacteria (Cornelissen et al, 1993, J. Bacteriol. 114: 5788-5797). In 
10 contrast, the mechanism by which Neisseriae utilize hemoglobin- and hemin-iron as 

well as the components involved have so far not been described. 

Recently, several proteins with hemoglobin-binding and/or hemin-binding 
activities have been identified in total membranes of iron-limited N. meningitidis and 
N. gonorrhoeae. 

15 Lee and Hill, 1992, /. gen. Microbiol. 13§: 2647-2656 disclose the specific 

hemoglobin binding by isolated outer membranes of N. meningitidis. 

Martek and Lee, 1994, Infect. Immun. 62: 700-703 disclosed that acquisition 
of heme iron by N. meningitidis does not involve meningococcal transferrin-binding 
proteins. 

20 Lee, 1994, Microbiol. 140: 1473-1480 describes the biochemical isolation and 

characterization of hemin binding proteins from N. meningitidis. 

The precise role of these proteins in hemin and/or hemoglobin utilization 
remains unclear at present, although these proteins are likely to be components of 
a hemin-utilization system in N. meningitidis. 

25 The dependence on host iron stores for Neisseria growth is a potentially 

useful route towards the development of novel and effective therapeutic intervention 
strategies. Historically, infections of both N. meningitidis and N. gonorrhoeae were 
treated chemoprophylactically with sulfonamide drugs. However, with the 
development of sulfonamide-resistant strains came the necessity of using alternative 

30 modes of therapy such as antibiotic treatment. More recently, the drug treatment of 

choice includes the administration of high grade penicillin. However, the success 
of antimicrobial treatment is decreased if therapy is not initiated early after infection. 
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Gonococcal infection has also been treated with penicillin, ampicillin, or 
amoxicillin, tetracycline hydrochloride, and spectinomycin. Unfortunately, because 
the incidence of infections due to penicillinase-producing bacteria has increased, 
several new, more expensive fl-lactam antibiotics have been used in treatment. 
5 Despite the fact that existing antibiotics have decreased the serious consequences of 

gonorrhea, their use has not lowered the incidence of the infection in the general 
population. 

Prevention of meningococcal disease has been attempted by chemoprophylaxis 
and immuno prophylaxis. At present, rifampin and minocycline are used, but only 

10 for humans in close contact with an infected person as this treatment has a number 

of disadvantages. The only commercially available vaccine against meningococcal 
meningitis has as its major component the bacterial polysaccharide capsule. In adults 
this vaccine protects against serogroups A, C, Y and W135. It is not effective 
against serogroup B, and is ineffective in children against serogroup C. Thus far, 

15 immunoprophylatic preventive treatment has not been available for N. gonorrhoeae. 

Thus, what is needed are better preventative therapies for meningococcal 
meningitis and gonorrhea including more effective, longer lasting vaccines which 
protect across all of the serogroups of N. meningitidis and all the serotypes of N. 
gonorrhoeae. In addition, better methods are need to treat meningococcal and 

20 gonococcal infection. 

SUMMARY OF THE INVENTION 

The present invention relates to the cloning, expression and functional 
characterization of genes encoding bacterial hemoglobin receptor proteins. 

25 Specifically, the invention relates to genes encoding hemoglobin receptor proteins 

from Neisseria species, in particular Neisseria meningitidis and N. gonorrhoeae. The 
invention comprises species of nucleic acids having a nucleotide sequence encoding 
novel bacterial hemoglobin receptor proteins. Also provided by this invention is the 
deduced amino acid sequence of the cognate hemoglobin receptor proteins of these 

30 bacterial genes. 

The invention provides nucleic acids, nucleic acid hybridization probes, 
recombinant expression constructs capable of expressing the hemoglobin receptor 
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protein of the invention in cultures of transformed cells, preferably bacterial cells, 
and such cultures of transformed bacterial cells that express the hemoglobin receptor 
proteins of the invention. The invention also provides gene knockout vectors for 
inactivating the hemoglobin receptor protein gene in cells, particularly cells of 
5 Neisseria species, via, for example, homologous recombination and other 
mechanisms, and cultures of such hemoglobin receptor protein null mutant cells. 

The invention also provides homogeneous preparations of the bacterial 
hemoglobin receptor proteins of the invention, as well as antibodies against and 
epitopes of the hemoglobin receptor protein. Methods for characterizing this 

10 receptor protein and methods for using the protein in the development of agents 
having pharmacological uses related to this receptor, particularly bactericidal and 
bacteriostatic uses, are also provided by the invention. 

In other embodiments of this invention are provided diagnostic methods and 
reagents encompassing the use of the anti-Neisseria hemoglobin receptor protein 

15 antibodies of the invention. Still further embodiments provided herein include 

therapeutic methods and reagents encompassing the use of the anti-Neisseria 
hemoglobin receptor protein antibodies of the invention. Even more embodiments 
include diagnostic methods and reagents encompassing the use of the Neisseria 
hemoglobin receptor protein-encoding nucleic acids of the invention, as sensitive 

20 probes for the presence of Neisseria infection using nucleic acid hybridization 
techniques and/or in vitro amplification methodologies. Yet additional embodiments 
of the invention include therapeutic methods and reagents encompassing the use of 
the Neisseria hemoglobin receptor protein-encoding nucleic acids of the invention, 
comprising recombinant expression constructs engineered to produce antisense 

25 transcripts of the Neisseria hemoglobin receptor gene and fragments thereof, as well 

as recombinant knockout vectors of the invention. The invention also provides the 
Neisseria hemoglobin receptor protein and epitopes thereof as components of 
vaccines for the development of non-disease associated immunity to pathological 
infection with bacteria of Neisseria species. 

30 In a first aspect, the invention provides a nucleic acid having a nucleotide 

sequence encoding a bacterial hemoglobin receptor protein gene. In a preferred 
embodiment, the bacterial hemoglobin receptor protein gene is isolated from bacteria 
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of Neisseria species. In a particularly preferred embodiment, the hemoglobin 
receptor protein gene is isolated from Neisseria meningitidis, serotype C. In a 
particular example of this embodiment, the nucleic acid comprises a 3.3 kilobase (kb) 
BamHL/Hindlll fragment of N. meningitidis genomic DNA. In this embodiment, the 
5 nucleotide sequence comprises an open reading frame of 2376 nucleotides of N. 

meningitidis genomic DNA encoding 792 amino acids comprising the hemoglobin 
receptor gene. In this embodiment of the invention, the nucleotide sequence of the 
N. meningitidis hemoglobin receptor gene is the sequence depicted in Figure 2 (SEQ 
ID No: 1). It will be understood that the N. meningitidis gene as disclosed herein is 

10 defined, insofar as is necessary, by the amino acid sequence of the protein encoded 
therein, said amino acid sequence being represented in Figure 2 (SEQ. ID No.: 2). 
Thus, it will be understood that the particular nucleotide sequence depicted in Figure 
2 (SEQ. ID. No.:l) is but one of a number of equivalent nucleotide sequences that 
encode the hemoglobin receptor protein, due to the degeneracy of the genetic code, 

15 and that all such alternative, equivalent nucleotide sequences are hereby explicitly 

encompassed within the disclosed nucleotide sequences of the invention. Also 
included herein are any mutant or allelic variations of this nucleotide sequence, either 
naturally occurring or the product of in vitro chemical or genetic modification. Each 
such variant will be understood to have essentially the same nucleotide sequence as 

20 the nucleotide sequence of the corresponding N. meningitidis hemoglobin receptor 

protein disclosed herein. 

In another particularly preferred embodiment of this aspect of the invention, 
the hemoglobin receptor protein gene is isolated from Neisseria meningitidis, 
serotype A. In a particular example of this embodiment, the nucleic acid comprises 

25 a 2373 basepair (bp) polymerase chain reaction-amplified fragment of N. 

meningitidis, serotype A genomic DNA. In this embodiment, the nucleotide 
sequence comprises an open reading frame of 2373 nucleotides of N. meningitidis 
genomic DNA encoding 790 amino acids comprising the hemoglobin receptor gene. 
In this embodiment of the invention, the nucleotide sequence of the N. meningitidis 

30 hemoglobin receptor gene is the sequence depicted in Figure 7 (SEQ ID No: 3). It 

will be understood that the N. meningitidis gene as disclosed herein is defined, 
insofar as is necessary, by the amino acid sequence of the protein encoded therein, 
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said amino acid sequence being represented in Figure 7 (SEQ. ID No.: 4). Thus, it 
will be understood that the particular nucleotide sequence depicted in Figure 7 (SEQ. 
ID. No. :3) is but one of a number of equivalent nucleotide sequences that encode the 
hemoglobin receptor protein, due to the degeneracy of the genetic code, and that all 
5 such alternative, equivalent nucleotide sequences are hereby explicitly encompassed 

within the disclosed nucleotide sequences of the invention. Also included herein are 
any mutant or allelic variations of this nucleotide sequence, either naturally occurring 
or the product of in vitro chemical or genetic modification. Each such variant will 
be understood to have essentially the same nucleotide sequence as the nucleotide 
10 sequence of the corresponding N. meningitidis hemoglobin receptor protein disclosed 

herein. 

In another particularly preferred embodiment of this aspect of the invention, 
the hemoglobin receptor protein gene is isolated from Neisseria meningitidis, 
serotype B. In a particular example of this embodiment, the nucleic acid comprises 

15 a 2376 basepair (bp) polymerase chain reaction-amplified fragment of N. 

meningitidis, serotype A genomic DNA. In this embodiment, the nucleotide 
sequence comprises an open reading frame of 2373 nucleotides of N. meningitidis 
genomic DNA encoding 791 amino acids comprising the hemoglobin receptor gene. 
In this embodiment of the invention, the nucleotide sequence of the N. meningitidis 

20 hemoglobin receptor gene is the sequence depicted in Figure 8 (SEQ ID No: 5). It 

will be understood that the N. meningitidis gene as disclosed herein is defined, 
insofar as is necessary, by the amino acid sequence of the protein encoded therein, 
said amino acid sequence being represented in Figure 8 (SEQ. ID No.: 6). Thus, it 
will be understood that the particular nucleotide sequence depicted in Figure 8 (SEQ. 

25 ID. No. :S) is but one of a number of equivalent nucleotide sequences that encode the 

hemoglobin receptor protein, due to the degeneracy of the genetic code, and that all 
such alternative, equivalent nucleotide sequences are hereby explicitly encompassed 
within the disclosed nucleotide sequences of the invention. Also included herein are 
any mutant or allelic variations of this nucleotide sequence, either naturally occurring 

30 or the product of in vitro chemical or genetic modification. Each such variant will 

be understood to have essentially the same nucleotide sequence as the nucleotide 
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sequence of the corresponding N. meningitidis hemoglobin receptor protein disclosed 
herein. 

In yet other preferred embodiments, the invention provides nucleic acid 
encoding a hemoglobin receptor protein gene isolated from Neisseria gonorrhoeae: 
In a particular example of this embodiment, the nucleic acid comprises a 2378 
basepair (bp) polymerase chain reaction-amplified fragment of N. gonorrhoeae 
genomic DNA. In this embodiment, the nucleotide sequence comprises an open 
reading frame of 2373 nucleotides of N. gonorrhoeae genomic DNA encoding 791 
amino acids comprising the hemoglobin receptor gene. In this embodiment of the 
invention, the nucleotide sequence of the N. gonorrhoeae hemoglobin receptor gene 
is the sequence depicted in Figure 9 (SEQ ID No: 7). It will be understood that the 
AT. gonorrhoeae gene as disclosed herein is defined, insofar as is necessary, by the 
amino acid sequence of the protein encoded therein, said amino acid sequence being 
represented in Figure 9 (SEQ. ID No.: 8). Thus, it will be understood that the 
particular nucleotide sequence depicted in Figure 9 (SEQ. ID. No.: 7) is but one of 
a number of equivalent nucleotide sequences that encode the hemoglobin receptor 
protein, due to the degeneracy of the genetic code, and that all such alternative, 
equivalent nucleotide sequences are hereby explicitly encompassed within the 
disclosed nucleotide sequences of the invention. Also included herein are any mutant 
or allelic variations of this nucleotide sequence, either naturally occurring or the 
product of in vitro chemical or genetic modification. Each such variant will be 
understood to have essentially the same nucleotide sequence as the nucleotide 
sequence of the corresponding N. gonorrhoeae hemoglobin receptor protein disclosed 
herein. 

The invention also provides bacterial hemoglobin receptor proteins. In a 
preferred embodiment, the bacterial hemoglobin receptor protein is isolated from 
bacteria of Neisseria species. In a particularly preferred embodiment, the 
hemoglobin receptor protein is isolated from Neisseria meningitidis. In a particular 
example of this embodiment, the protein is derived from N. meningitidis, serotype 
C and comprises an amino acid sequence of 792 amino acids. In this embodiment 
of the invention, the amino acid sequence of the N. meningitidis, serotype C 
hemoglobin receptor protein is the sequence depicted in Figure 2 (SEQ ID No: 2). 
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In another example of this embodiment, the protein is derived from N. meningitidis, 
serotype A and comprises an amino acid sequence of 790 amino acids. In this 
embodiment of the invention, the amino acid sequence of the N. meningitidis, 
serotype A hemoglobin receptor protein is the sequence depicted in Figure 7 (SEQ 
5 ID No:4). In yet another example of this embodiment, the protein is derived from 

N. meningitidis, serotype B and comprises an amino acid sequence of 791 amino 
acids. In this embodiment of the invention, the amino acid sequence of the N. 
meningitidis, serotype B hemoglobin receptor protein is the sequence depicted in 
Figure 8 (SEQ ID No:6). The invention also provides hemoglobin receptor protein 

10 derived from N. gonorrhoeae. In this embodiment of the invention, the protein 
comprises an amino acid sequence of 791 amino acids, and the amino acid sequence 
of the N. gonorrhoeae hemoglobin receptor protein is the sequence depicted in 
Figure 9 (SEQ ID No: 8). Also explicitly encompassed within the scope of this 
invention are related bacterial hemoglobin receptor proteins, particularly such 

15 proteins isolated from Neisseria species, having essentially the same amino acid 

sequence and substantially the same biological properties as the hemoglobin receptor 
protein encoded by the N. meningitidis and N. gonorrhoeae nucleotide sequences 
described herein. 

In another aspect, the invention provides a homogeneous preparation of an 
20 approximately 85.5 kiloDalton (kD) bacterial hemoglobin receptor protein or 
derivative thereof, said size being understood to be the size of the protein before any 
post-translational modifications thereof. Also provided is a 90kD embodiment of the 
receptor as determined by sodium dodecyl sulfate/ polyacrylamide gel electrophoresis 
under reducing conditions. In a preferred embodiment, the bacterial hemoglobin 
25 receptor protein is isolated from bacteria of Neisseria species. In a particularly 
preferred embodiment, the hemoglobin receptor protein is isolated from Neisseria 
meningitidis. In one embodiment of this aspect of the invention, the protein is 
isolated from N. meningitidis, serotype C and the amino acid sequence of the 
* bacterial hemoglobin receptor protein or derivative thereof preferably is the amino 
30 acid sequence of the hemoglobin receptor protein shown in Figure 2 (SEQ ID No:2). 

In a second embodiment of this aspect of the invention, the protein is isolated from 
N. meningitidis, serotype A and the amino acid sequence of the bacterial hemoglobin 
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receptor protein or derivative thereof preferably is the amino acid sequence of the 
hemoglobin receptor protein shown in Figure 7 (SEQ ID No: 4). In a third 
embodiment of this aspect of the invention, the protein is isolated from N. 
meningitidis, serotype B and the amino acid sequence of the bacterial hemoglobin 
5 receptor protein or derivative thereof preferably is the amino acid sequence of the 

hemoglobin receptor protein shown in Figure 8 (SEQ ID No:6). The invention also 
provides a homogeneous preparation of a bacterial hemoglobin receptor protein 
isolated from N. gonorrhoeae. In a preferred embodiment, the amino acid sequence 
of the bacterial hemoglobin receptor protein or derivative thereof preferably is the 
10 amino acid sequence of the hemoglobin receptor protein shown in Figure 9 (SEQ ID 

No: 8). 

This invention provides nucleotide probes derived from the nucleotide 
sequences herein provided. The invention includes probes isolated from either 
complementary DNA (cDNA) copies of bacterial messenger RNA (mRNA) or 

15 bacterial genomic DNA (gDNA), as well as probes made synthetically or by in vitro 

amplification methods using the sequence information provided herein. The 
invention specifically includes but is not limited to oligonucleotide, nick-translated, 
random primed, or in vitro amplified probes made using cDNA or genomic clones 
embodying the invention, and oligonucleotide and other synthetic probes synthesized 

20 chemically using the nucleotide sequence information of cDNA or genomic clone 

embodiments of the invention. 

It is a further object of this invention to provide such nucleic acid 
hybridization probes to detect the presence of bacteria of Neisseria species, 
particularly N. meningitidis and N. gonorrhoeae, in a biological sample in the 

25 diagnosis of a Neisseria infection in a human. Such a biological sample preferably 

includes blood, urine, semen, mucus, cerebrospinal fluid, peritoneal fluid and ascites 
fluids, as well as cell scrapings from the epithelium of the mouth, urethra, anus and 
rectum, and other organs. 

The present invention also includes peptides encoded, by the nucleotide 

30 sequences comprising the nucleic acid embodiments of the invention. The invention 

includes either naturally occurring or synthetic peptides which may be used as 
antigens for the production of hemoglobin receptor protein-specific antibodies. The 

- 10 - 

BNSDOCID: <WO 9612020A3_IA> 



WO 96/12020 




PCT/US95/13623 



invention also comprises such antibodies, preferably monoclonal antibodies, and cells 
and cultures of cells producing such antibodies. 

Thus, the invention also provides antibodies against and epitopes of bacterial 
hemoglobin receptor proteins of the invention. It is an object of the present 
5 invention to provide antibodies that are immunologically reactive to the bacterial 

hemoglobin receptor proteins of the invention; It is a particular object to provide 
monoclonal antibodies against these bacterial hemoglobin receptor proteins. In a 
preferred embodiment, antibodies provided are raised against bacterial hemoglobin 
receptor protein isolated from bacteria of Neisseria species. In a particularly 

10 preferred embodiment, such antibodies are specific for the hemoglobin receptor 
protein isolated from Neisseria meningitidis serotypes A, B or C. In additional 
particularly preferred embodiment, such antibodies are specific for the hemoglobin 
receptor protein isolated from Neisseria gonorrhoeae. 

Hybridoma cell lines producing such antibodies are also objects of the 

15 invention. It is envisioned at such hybridoma cell lines may be produced as the 

result of fusion between a non-immunoglobulin producing mouse myeloma cell line 
and spleen cells derived from a mouse immunized with purified hemoglobin receptor 
protein or a cell expressing antigens or epitopes of bacterial hemoglobin receptor 
proteins of the invention. The present invention also provides hybridoma cell lines 

20 that produce such antibodies, and can be injected into a living mouse to provide an 

ascites fluid from the mouse that is comprised of such antibodies. In a preferred 
embodiment, antibodies provided are raised against bacterial hemoglobin receptor 
protein isolated from bacteria of Neisseria species. In a particularly preferred 
embodiment, such antibodies are specific for the hemoglobin receptor protein isolated 

25 from Neisseria meningitidis, serotypes A, B or C. In additional particularly 

preferred embodiment, such antibodies are specific for the hemoglobin receptor 
protein isolated from Neisseria gonorrhoeae. 

It is a further object of the invention to provide immunologically-active 
epitopes of the bacterial hemoglobin receptor proteins of the invention. Chimeric 

30 antibodies immunologically reactive against the bacterial hemoglobin receptor 
proteins of the invention are also within the scope of this invention. In a preferred 
embodiment, antibodies and epitopes provided are raised against or derived from 
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bacterial hemoglobin receptor protein isolated from bacteria of Neisseria species. 
In a particularly preferred embodiment, such antibodies and epitopes are specific for 
the hemoglobin receptor protein isolated from Neisseria meningitidis, serotypes A, 
B or C. In additional particularly preferred embodiment, such antibodies and 
5 epitopes are specific for the hemoglobin receptor protein isolated from Neisseria 

gonorrhoeae. 

The present invention provides recombinant expression constructs comprising 
a nucleic acid encoding a bacterial hemoglobin receptor protein wherein the construct 
is capable of expressing the encoded hemoglobin receptor protein in cultures of cells 

10 transformed with the construct. Preferred embodiments of such constructs comprise 
the N. meningitidis, serotype C hemoglobin receptor gene depicted in Figure 2 (SEQ 
ID No.:l), such constructs being capable of expressing the bacterial hemoglobin 
receptor protein encoded therein in cells transformed with the construct. Additional 
preferred embodiments of such constructs comprise the N. meningitidis, serotype A 

15 hemoglobin receptor gene depicted in Figure 7 (SEQ ID No.: 3), such constructs 

being capable of expressing the bacterial hemoglobin receptor protein encoded 
therein in cells transformed with the construct. Further additional preferred 
embodiments of such constructs comprise the N. meningitidis, serotype B hemoglobin 
receptor gene depicted in Figure 8 (SEQ ID No.:5), such constructs being capable 

20 of expressing the bacterial hemoglobin receptor protein encoded therein in cells 
transformed with the construct. The invention also provides recombinant expression 
constructs encoding a hemoglobin receptor protein gene isolted from ZN. 
gonorrhoeae. In a particularly preferred embodiment, such constructs comprise the 
N. gonorrhoeae hemoglobin receptor gene depicted in Figure 9 (SEQ ID No. :7), the 

25 constructs being capable of expressing the bacterial hemoglobin receptor protein 

encoded therein in cells transformed with the construct. 

The invention also provides cultures of cells, preferably bacterial cells, having 
been transformed with the recombinant expression constructs of the invention, each 
such cultures being capable of and in fact expressing the bacterial hemoglobin 

30 receptor protein encoded in the transforming construct. 

The present invention also includes within its scope protein preparations of 
prokaryotic cell membranes containing the bacterial hemoglobin receptor protein of 
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the invention, derived from cultures of prokaryotic cells transformed with the 
recombinant expression constructs of the invention. 

The invention also provides diagnostic reagents and methods for using such 
reagents for detecting the existence of an infection in a human, with bacteria of a 
5 Neisseria species. In preferred embodiments, such diagnostic reagents comprise 

antibodies that are immunologically reactive with a bacterial hemoglobin receptor 
protein. In a preferred embodiment, such antibodies are raised against a bacterial 
hemoglobin receptor protein isolated from bacteria of Neisseria species. In a 
particularly preferred embodiment, such antibodies are specific for the hemoglobin 

10 receptor protein isolated from Neisseria meningitidis, serotypes A, B or C. In h 

additional particularly preferred embodiments, such antibodies are specific for the 
hemoglobin receptor protein isolated from Neisseria gonorrhoeae. 

In yet another embodiment of this aspect of the invention are provided 
diagnostic reagents and methods for using such reagents wherein said reagents are j 

IS nucleic acid hybridization probes comprising a bacterial hemoglobin receptor gene. : 

In a preferred embodiment, the bacterial hemoglobin receptor protein gene is isolated 
from bacteria of Neisseria species. In a particularly preferred embodiment, the * 
hemoglobin receptor protein gene is isolated from Neisseria meningitidis. In ^ 
particular examples of this embodiment of the invention, the nucleic acid probes 

20 comprise a specifically-hybridizing fragment of a 3.3 kilobase (kb) BamHUHindBl ; 

fragment of N. meningitidis, serotype C genomic DNA. In this embodiment, the x 
nucleotide sequence comprises all or a specifically-hybridizing fragment of an open 
reading frame of 2376 nucleotides of N. meningitidis, serotype C genomic DNA 
encoding 792 amino acids comprising the hemoglobin receptor gene. In this 

25 embodiment of the invention, the nucleotide sequence of the N. meningitidis, 

serotype C hemoglobin receptor gene is the sequence depicted in Figure 2 (SEQ ID 
No:l). In another example of this embodiment of the invention, the nucleic acid 
probes comprise a specifically-hybridizing fragment of a 2373bp, polymerase chain 
reaction-amplified fragment of N. meningitidis, serotype A genomic DNA. In this 

30 embodiment, the nucleotide sequence comprises all or a specifically-hybridizing 
fragment of an open reading frame of 2370 nucleotides of N. meningitidis, serotype 
A genomic DNA encoding 790 amino acids comprising the hemoglobin receptor 
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gene. In this embodiment of the invention, the nucleotide sequence of the N. 
meningitidis, serotype A hemoglobin receptor gene is the sequence depicted in Figure 
7 (SEQ ID No:3). In yet another example of this embodiment of the invention, the 
nucleic acid probes comprise a specifically-hybridizing fragment of a 2376bp, 
5 polymerase chain reaction-amplified fragment of N. meningitidis, serotype B genomic 

DNA. In this embodiment, the nucleotide sequence comprises all or a specifically- 
hybridizing fragment of an open reading frame of 2373 nucleotides of N. 
meningitidis , serotype B genomic DNA encoding 791 amino acids comprising the 
hemoglobin receptor gene. In this embodiment of the invention, the nucleotide 

10 sequence of the N. meningitidis, serotype B hemoglobin receptor gene is the 

sequence depicted in Figure 8 (SEQ ID No: 5). The invention also provides nucleic 
acid hybridization probes comprising a bacterial hemoglobin receptor gene isolated 
from N. gonorrhoeae. In a preferred embodiment of this aspect of the invention, the 
nucleic acid probes comprise a specifically-hybridizing fragment of a 2378bp, 

15 polymerase chain reaction-amplified fragment of N. gonorrhoeae genomic DNA. In 

this embodiment, the nucleotide sequence comprises all or a specifically-hybridizing 
fragment of an open reading frame of 2373 nucleotides of N. gonorrhoeae genomic 
DNA encoding 791 amino acids comprising the hemoglobin receptor gene. In this 
embodiment of the invention, the nucleotide sequence of the N. gonorrhoeae 

20 hemoglobin receptor gene is the sequence depicted in Figure 9 (SEQ ID No: 7). It 

will be understood that the term "specifically-hybridizing" when used to describe a 
fragment of a nucleic acid encoding a bacterial hemoglobin receptor gene is intended 
to mean that nucleic acid hybridization of such a fragment is stable under high 
stringency conditions of hybridization and washing as the term "high stringency" 

25 would be understood by those having skill in the molecular biological arts. 

Also provided by the invention are therapeutic agents and methods for using 
such agents for treating the an infection in a human, with bacteria of a Neisseria 
species. In preferred embodiments, such agents comprise antibodies that are 
immunologically reactive with a bacterial hemoglobin receptor protein. In a 

30 preferred embodiment, such antibodies are raised against a bacterial hemoglobin 

receptor protein isolated from bacteria of Neisseria species. In a particularly 
preferred embodiment, such antibodies are specific for the hemoglobin receptor 
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protein isolated from Neisseria meningitidis, serotypes A, B or C. In additional 
preferred embodiments, such antibodies are specific for the hemoglobin receptor 
protein isolated from Neisseria gonorrhoeae. Therapeutic agents provided in this 
aspect of the invention comprise such antibodies in a pharmaceutically-acceptable 
5 carrier, along with appropriate adjuvants and the like. In additional embodiments, 

such antibodies are covalently conjugated to a bactericidal or bacteriostatic agent 
effective against bacteria of Neisseria species, preferably N. meningitidis and N. 
gonorrhoeae. 

In yet another embodiment of this aspect of the invention are provided 

10 therapeutic reagents and methods for using such reagents wherein said reagents 
comprise recombinant expression constructs of the invention, or a homologue thereof 
that expresses the nucleic acid encoding a hemoglobin receptor in an antisense 
orientation. In a preferred embodiment, the bacterial hemoglobin receptor protein 
gene is isolated from bacteria of Neisseria species. In a particularly preferred 

15 embodiment, the hemoglobin receptor protein gene is isolated from Neisseria 
meningitidis. In particular examples of this embodiment of the invention, the nucleic 
acids comprise a specifically-hybridizing fragment of a 3.3 kilobase (kb) 
BamlU/Hindni fragment of N. meningitidis, serotype C genomic DNA. In this 
embodiment, the nucleotide sequence comprises all or a specifically-hybridizing 

20 fragment of an open reading frame of 2376 nucleotides of N meningitidis, serotype 
C genomic DNA encoding 792 amino acids comprising the hemoglobin receptor 
gene. In this embodiment of the invention, the nucleotide sequence of the N. 
meningitidis, serotype C hemoglobin receptor gene is the sequence depicted in Figure 
2 (SEQ ID No:l). In another example of this embodiment of the invention, the 

25 nucleic acid probes comprise a specifically-hybridizing fragment of a 2373bp, 
polymerase chain reaction-amplified fragment of N. meningitidis, serotype A 
genomic DNA. In this embodiment, the nucleotide sequence comprises all or a 
specifically-hybridizing fragment of an open reading frame of 2370 nucleotides of 
N meningitidis, serotype A genomic DNA encoding 790 amino acids comprising the 

30 hemoglobin receptor gene. In this embodiment of the invention, the nucleotide 
sequence of the N. meningitidis, serotype A hemoglobin receptor gene is the 
sequence depicted in Figure 7 (SEQ ID No: 3). In yet another example of this 
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embodiment of the invention, the nucleic acid probes comprise a specifically- 
hybridizing fragment of a 2376bp, polymerase chain reaction-amplified fragment of 
TV. meningitidis, serotype B genomic DNA. In this embodiment, the nucleotide 
sequence comprises all or a specifically-hybridizing fragment of an open reading 
frame of 2373 nucleotides of TV. meningitidis, serotype B genomic DNA encoding 
791 amino acids comprising the hemoglobin receptor gene. In this embodiment of 
the invention, the nucleotide sequence of the TV. meningitidis, serotype B hemoglobin 
receptor gene is the sequence depicted in Figure 8 (SEQ ID No:5). The invention 
also provides recombinant expression constructs of the invention, or a homologue 
thereof that expresses the nucleic acid encoding a hemoglobin receptor in an 
antisense orientation, wherein the nucleic acid encodes a bacterial hemoglobin 
receptor gene isolated from TV. gonorrhoeae. In a preferred embodiment of this 
aspect of the invention, the nucleic acid probes comprise a specifically-hybridizing 
fragment of a 2378bp, polymerase chain reaction-amplified fragment of TV. 
gonorrhoeae genomic DNA. In this embodiment, the nucleotide sequence comprises 
all or a specifically-hybridizing fragment of an open reading frame of 2373 
nucleotides of TV. gonorrhoeae genomic DNA encoding 791 amino acids comprising 
the hemoglobin receptor gene. In this embodiment of the invention, the nucleotide 
sequence of the TV. gonorrhoeae hemoglobin receptor gene is the sequence depicted 
in Figure 9 (SEQ ID No:7). 

The invention also provides a method for screening compounds for their 
ability to inhibit, facilitate or modulate the biochemical activity of a bacterial 
hemoglobin receptor protein of the invention, for use in the in vitro screening of 
novel agonist and antagonist compounds and novel bactericidal and bacteriostatic 
agents specific for the hemoglobin receptor protein. In preferred embodiments, cells 
transformed with a recombinant expression construct of the invention are contacted 
with such a compound, and the binding capacity of the compounds, as well as the 
effect of the compound on binding of other, known hemoglobin receptor agonists 
such as hemoglobin and hem in, and antagonists, is assayed. Additional preferred 
embodiments comprise quantitative analyses of such effects. 

The present invention is also useful for the detection of bactericidal and/or 
bacteriostatic analogues, agonists or antagonists, known or unknown, of a bacterial 
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hemoglobin receptor protein, preferably derived from bacteria of Neisseria species, 
most preferably isolated from N meningitidis, wherein such compounds are either 
naturally occurring or embodied as a drug. 

The invention also provides vaccines for immunizing a human against 
5 infection with pathogenic bacteria of Neisseria species, the vaccines comprising the 

hemoglobin binding proteins of the invention or antigenic fragments thereof. In a 
preferred embodiment, the vaccines of the invention comprise cells expressing a 
hemoglobin receptor binding protein of the invention, or an antigenic fragment 
thereof, preferably wherein said cells are attenuated varieties of cells adapted for 

10 growth in humans, i.e., wherein such cells are non-pathogenic and do not cause 
bactermia, endotoxemia or sepsis. Examples of such attenuated varieties of cells 
include attenuated strains of Salmonella species, for example Salmonella typhi and 
Salmonella typhimurium, as well as other attenuated bacterial species. Also provided 
by the invention are recombinant expression constructs as disclosed herein useful per 

15 se as vaccines, for introduction into an animal and production of an immunologic 
response to bacterial hemoglobin receptor protein antigens encoded therein. 

Specific preferred embodiments of the present invention will become evident 
from the following more detailed description of certain preferred embodiments and 
the claims. 

20 

DESCRIPTION OF THE DRAWINGS 

The foregoing and other objects of the present invention, the various features 
thereof, as well as the invention itself may be more fully understood from the 
following description, when read together with the accompanying drawings in which: 
25 Figure 1 is a schematic drawing of the restriction enzyme digestion map of 

a N. meningitidis cosmid clone and subclones thereof derived as described in 
Example 2. 

Figure 2 illustrates the nucleotide (SEQ ID No.:l) and deduced amino acid 
(SEQ ID No.:2) sequences of the N. meningitidis hemoglobin receptor protein 
30 encoded in a 3.3 kb BamHl/Hindni DNA fragment. 

Figure 3 presents a photograph of a stained SDS/ 10% PAGE electrophoresis 
gel showing the results of in vitro expression of the N. meningitidis hemoglobin 
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receptor gene product as an approximately 90 kilodalton protein, and ^-lactamase 
protein having a molecular weight of about 30.0 kilodaltons used as a molecular 
weight marker. 

Figure 4 presents an amino acid sequence comparison between portions of the 
5 N. meningitidis transferrin receptor Tbpl (SEQ ID No.:9), the N. meningitidis 

lactoferrin receptor LbpA (SEQ ID No.: 10), and N. meningitidis hemoglobin 
receptor HmbR (SEQ ID No. :2). 

Figure 5 illustrates Southern hybridization analysis of chromosomal DNA 
from AT. meningitidis 8013 and the MC8013/imW? mutant using a BamHl-Sall 
10 fragment of the hmb gene as probe labeled using a DIG nonradioactive DNA 

labelling and detection kit (Boehringer Mannheim Biochemicals, Indianapolis, IN). 
Lane 1 contains DNA from TV. meningitidis strain MC8013, digested with Clal; lane 
2 is MCS031hmbR DNA digested with Clal; lane 3, is MC8013 DNA digested with 
BamUl and San; and lane 4 is MCSOlShmbR DNA digested with BamHl and Sail. 
15 Figure 6 is a graph describing the course of infection using N. meningitidis 

wild type (MC8013) and hmbR mutant strains in an in vivo rat infant infection 
model. Each strain was injected intraperitoneally (2 x 10 6 CFU) into three infant 
inbred Lewis rats. The results represent the average of two similarly-performed 
experiments. 

20 Figure 7 illustrates the nucleotide (SEQ ID No.: 3) and deduced amino acid 

(SEQ ID No.: 4) sequences of the N. meningitidis, serotype A hemoglobin receptor 
protein encoded on a 2373bp polymerase chain reaction-amplified DNA fragment. 

Figure 8 illustrates the nucleotide (SEQ ID No.:5) and deduced amino acid 
(SEQ ID No.: 6) sequences of the N. meningitidis, serotype B hemoglobin receptor 
25 protein encoded on a 2376bp polymerase chain reaction-amplified DNA fragment. 

Figure 9 illustrates the nucleotide (SEQ ID No.:7) and deduced amino acid 
(SEQ ID No. :8) sequences of the N. gonorrhoeae hemoglobin receptor protein 
encoded on a 2376bp polymerase chain reaction-amplified DNA fragment. 

Figure 10 represents a schematic of a nucleic acid sequence comparison 
30 between the hemoglobin receptor proteins derived from N. meningitidis, serotypes 

A (SEQ ID No.:3), B (SEQ ID No.:5) and C (SEQ ID No.:l) and from N. 
gonorrhoeae (SEQ ID No.: 7), wherein the direction of trascription of the genes is 
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in the direction of the arrow, and the following abbreviations refer to restriction 
endonuclease sites: H represents HindUl; N represents Noil; Bg represents BgH\ Bs 
represents BssHI; Nr represents Nrul; CI represents Clal; P represents Pstl; Sa 
represents Sad; Av represents Aval; B represents BamUl; S represents Sail; EV 
5 represents EcoKV; Sh represents Sphl; and Sy represents Styl. 

Figure 11 presents an amino acid sequence comparison between the 
hemoglobin receptor proteins derived from N. meningitidis, serotypes A (SEQ ID 
No.:4), B (SEQ ID No.:6) and C (SEQ ID No.:2) and from N. gonorrhoeae (SEQ 
ID No.:8). 

10 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The term "bacterial hemoglobin receptor* as used herein refers to bacterial 
proteins comprising the outer membrane of Gram negative bacteria, which 
specifically mediate transit of hemoglobin-derived hemin, as well as hemin from 

15 other sources, through the outer membrane of such bacteria and into the periplasmic 

space. The bacterial hemoglobin receptor proteins of the invention are characterized 
by, first, an amino acid sequence that is essentially the sequence depicted in Figures 
2 (SEQ ID No.:2), 7 (SEQ ID No.:4), 8 (SEQ ID No.:6) and 9 (SEQ ID No.:8). 
The bacterial hemoglobin receptor proteins of the invention are further characterized 

20 by having substantially the same biological activity as a protein having the amino 

acid sequence depicted in Figures 2 (SEQ ID No.:2), 7 (SEQ ID No.:4), 8 (SEQ ID 
No.: 6) and 9 (SEQ ID No.: 8). This definition is intended to encompass naturally- 
occurring variants and mutant proteins, as well as genetically engineered variants 
made by man. 

25 Cloned, isolated and purified nucleic acid provided by the present invention 

may encode a bacterial hemoglobin receptor protein of any Neisseria species of 
origin, including, most preferably, Neisseria meningitidis species and serotypes 
thereof and Neisseria gonorhoeae species. 

The nucleic acid hybridization probes provided by the invention comprise 

30 DNA or RNA having all or a specifically-hybridizing fragment of the nucleotide 

sequence of the hemoglobin receptor protein as depicted in Figures 2 (SEQ ID 
No.:l), 7 (SEQ ID No.:3), 8 (SEQ ID No.:5) and 9 (SEQ ID No.:7), or any portion 
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thereof effective in nucleic acid hybridization. Mixtures of such nucleic acid 
hybridization probes are also within the scope of this embodiment of the invention. 
Nucleic acid probes as provided herein are useful for detecting the presence of a 
bacteria, inter alia, in a human as the result of an infection, in contaminated 
5 biological samples and specimens, in foodstuffs and water supplies, or in any 

substance that may come in to contact with the human. Specific hybridization will 
be understood to mean that the nucleic acid probes of the invention are capable of 
forming stable, specific hybridization to bacterially-derived DNA or RNA under 
conditions of high stringency, as the term "high stringency" would be understood by 

10 those with skill in the art (see, for example, Sambrook et al. 9 1989, Molecular 

Cloning: A Laboratory Manual . Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y. and Hames and Higgins, eds., 1985, Nucleic Acid Hybridization . IRL 
Press, Oxford, U.K.). Hybridization will be understood to be accomplished using 
well-established techniques, including but not limited to Southern blot hybridization, 

15 Northern blot hybridization, in situ hybridization and Southern hybridization to 

polymerase chain reaction product DNAs. The invention will thus be understood to 
provide oligonucleotides, specifically, pairs of oligonucleotides, for use as primers 
in support of in vitro amplification of bacterial hemoglobin receptor genes and 
mRNA transcripts. 

20 The production of proteins such as bacterial hemoglobin receptor proteins 

from cloned genes by genetic engineering means is well known in this art. The 
discussion which follows is accordingly intended as an overview of this field, and is 
not intended to reflect the full state of the art. It will be understood from the 
following discussion that the hemoglobin receptor protein genes of this invention are 

25 particularly advantageous, since expression of such proteins by bacteria, including 

non-Neisseria species of bacteria, can complement certain auxotrophic mutants of 
said transformed bacteria otherwise unable to subsist absent supplementation of the 
growth media with iron (ID). 

DNA encoding a bacterial hemoglobin receptor protein, in view of the instant 

30 disclosure, by chemical synthesis, by screening reverse transcripts of mRNA from 

appropriate cells, by screening genomic libraries from appropriate cells, or by 
combinations of these procedures, as illustrated below. Screening of mRNA or 

-20- 

BNSDOCID:<WO 9612020A3_IA> 



WO 96/12020 




PCT/US95/13623 



genomic DNA may be carried out with oligonucleotide probes generated from the 
nucleic acid sequence information from the bacterial hemoglobin receptor protein 
disclosed herein. Probes may be labeled with a detectable group such as a 
fluorescent group, a radioactive atom or a chemiluminescent group in accordance 
5 with know procedures and used in conventional hybridization assays, as described 

in greater detail in the Examples below. In the alternative, bacterial hemoglobin 
receptor protein-encoding nucleic acids may be obtained by use of the polymerase 
chain reaction (PGR) procedure, using appropriate pairs of PCR oligonucleotide 
primers corresponding to nucleic acid sequence information derived from a bacterial 

10 hemoglobin receptor protein as provided herein. See U.S. Patent Nos. 4,683,195 
to Mullis et aL and 4,683,202 to Mullis, as specifically disclosed herein in Example 
9 below. In another alternative, such bacterial hemoglobin receptor protein-encoding 
nucleic acids may be isolated from auxotrophic cells transformed with a bacterial 
hemoglobin receptor protein gene, thereby relieved of the nutritional requirement for 

15 uncomplexed iron (III). 

Any bacterial hemoglobin receptor protein of the invention may be 
synthesized in host cells transformed with a recombinant expression construct 
comprising a nucleic acid encoding the bacterial hemoglobin receptor protein. Such 
recombinant expression constructs can also be comprised of a vector that is a 

20 replicable DNA construct. Vectors are used herein either to amplify DNA encoding 
a bacterial hemoglobin receptor protein and/or to express DNA encoding a bacterial 
hemoglobin receptor protein. For the purposes of this invention, a recombinant 
expression construct is a replicable DNA construct in which a nucleic acid encoding 
a bacterial hemoglobin receptor protein is operably linked to suitable control 

25 sequences capable of effecting the expression of the bacterial hemoglobin receptor 
protein in a suitable host cell. 

The need for such control sequences will vary depending upon the host cell 
selected and the transformation method chosen. Generally, bacterial control 
sequences include a transcriptional promoter, an optional operator sequence to 

30 control transcription, a sequence encoding suitable mRNA ribosomal binding sites 
(the Shine-Delgarno sequence), and sequences which control the termination of 
transcription and translation. Amplification vectors do not require expression control 

-21 - 



BNSDOCID: <WO 9612020A3JA> 



WO 96/12020 




PCT/US95/13623 



domains. All that is needed is the ability to replicate in a host, usually conferred by 
an origin of replication, and a selection gene to facilitate recognition of 
transformants. See, Sambrook et al. f 1989, ibid. 

Vectors useful for practicing the present invention include plasmids and virus- 
5 derived constructs, including phage and particularly bacteriophage, and integratable 

DNA fragments (i.e., fragments integratable into the host genome. by homologous 
recombination). The vector replicates and functions independently of the host 
genome, or may, in some instances, integrate into the genome^ itself . Suitable 
vectors will contain replicon and control sequences which are derived from species 

10 compatible with the intended expression host. A preferred vector is pLAFR2 (see 

Riboli et aL y 1991, Microb. Pathogen. 10: 393-403). 

Transformed host cells are cells which have been transformed or transfected 
with recombinant expression constructs made using recombinant DNA techniques and 
comprising nucleic acid encoding a bacterial hemoglobin receptor protein. Preferred 

15 host cells are cells of Neisseria species, particularly N. meningitidis, as well as 

Salmonella typhi and Salmonella typhimurium species, and Escherichia coli 
auxotrophic mutant cells (hemA aroB). Transformed host cells may express the 
bacterial hemoglobin receptor protein, but host cells transformed for purposes of 
cloning or amplifying nucleic acid hybridization probe DNA need not express the 

20 receptor protein. When expressed, the bacterial hemoglobin receptor protein of the 

invention will typically be located in the host cell outer membrane. See, Sambrook 
et al. , ibid. 

Cultures of bacterial cells, particularly cells of Neisseria species, and certain 
E. coli mutants, are a desirable host for recombinant bacterial hemoglobin receptor 

25 protein synthesis. In principal, any bacterial ceil auxotrophic for uncomplexed iron 

(III) is useful for selectively growing bacterial hemoglobin receptor protein- 
transformed cells. However, for this purpose, well-characterized auxotrophs, such 
as E. coli hemA aroB mutants are preferred. 

The invention provides homogeneous compositions of a bacterial hemoglobin 

30 receptor protein produced by transformed cells as provided herein. Each such 

homogeneous composition is intended to be comprised of a bacterial hemoglobin 
receptor protein that comprises at least 90% of the protein in such a homogenous 
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composition. The invention also provides membrane preparations from cells 
expressing a bacterial hemoglobin receptor protein as the result of transformation 
with a recombinant expression construct of the invention, as described herein. 

Bacterial hemoglobin receptor proteins, peptide fragments thereof and 
5 membranes derived from cells expressing such proteins in accordance with the 

present invention may be used for the production of vaccines effective against 
bacterial infections in a human, with pathogenic microorganisms expressing such 
bacterial hemoglobin receptor proteins. Such vaccines preferably would be effective 
in raising an immunological response against bacteria of Neisseria species, most 

10 preferably AT. meningitidis and N. gonorhoeae. Also encompassed within the 

vaccines provided by the invention are recombinant expression constructs as 
disclosed herein useful per se as vaccines, for introduction into an animal and 
production of an immunologic response to bacterial hemoglobin receptor protein 
antigens encoded therein. 

15 Preparation of vaccines which contain polypeptide or polynucleotide 

sequences as active ingredients is well understood in the art. Typically, such 
vaccines are prepared as injectables, either as liquid solutions or suspensions. 
However, solid forms suitable for solution in, or suspension in, liquid prior to 
injection may also be prepared. The preparation may also be emulsified. The active 

20 immunogenic ingredient is often mixed with excipients which are pharmaceutically 

acceptable and compatible with the active ingredient. Suitable excipients are, for 
example, water, saline, dextrose, glycerol, ethanol, or the like and combinations 
thereof. In addition, if desired, the vaccine may contain minor amounts of auxiliary 
substances such as wetting or emulsifying agents, pH buffering agents, or adjuvants 

25 which enhance the effectiveness of the vaccine. The vaccines are conventionally 

administered parenterally, by injection, for example, either subcutaneously or 
intramuscularly. Additional formulations which are suitable for other modes of 
administration include suppositories and, in some cases, oral formulations. For 
suppositories, traditional binders and carriers may include, for example, polyalkalene 

30 glycols or triglycerides; such suppositories may be formed from mixtures containing 

the active ingredient in the range of 0.5% to 10%, preferably 1 to 2%. Oral 
formulations include such normally employed excipients as, for example, 
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pharmaceutical grades of manitol, lactose, starch, magnesium s tear ate, sodium 
saccharine, cellulose, magnesium carbonate and the like. These compositions take 
the form of solutions, suspensions, tablets, pills, capsules, sustained release 
formulations or powders and contain 10% to 95% of active ingredient, preferably 25 
5 to 70%. 

The polypeptides of the invention may be formulated into the vaccine as 
neutral or salt forms. Pharmaceutically acceptable salts, include the acid additional 
salts (formed with the free amino groups of the peptide) and which are formed with 
inorganic acids such as, for example, hydrochloric or phosphoric acids, or such 
10 organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with 

the free carboxyl groups may also be derived from inorganic bases such as, for 
example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such 
organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, 
procaine, and the like. 

15 In another embodiment, such vaccines are provided wherein the bacterial 

hemoglobin receptor proteins or peptide fragments thereof are present in the intact 
cell membranes of cells expressing such proteins in accordance with the present 
invention. In preferred embodiments, cells useful in these embodiments include 
attenuated varieties of cells adapted to growth in humans. Most preferably, said cells 

20 are attenuated varieties of cells adapted for growth in humans, i.e., wherein such 

cells do not cause frank disease or other pathological conditions, such as bactermia, 
endotoxemia or sepsis. For the purposes of this invention, "attenuated" cells will be 
understood to encompass prokaryotic and eukaryotic cells that do not cause infection, 
disease, septicemia, endotoxic shock, pyrogenic shock, or other serious and adverse 

25 reactions to administration of vaccines to an animal, most preferably a human, when 

such cells are introduced into the animal, whether such cells are viable, living, heat-, 
chemically- or genetically attenuated or inactivated, or dead. It will be appreciated 
by those with skill in this art that certain minor side-effects of vaccination, such as 
short-term fever, muscle discomfort, general malaise, and other well-known reactions 

30 to vaccination using a variety of different types of vaccines, can be anticipated as 

accompanying vaccination of an animal, preferably a human, using the vaccines of 
the invention. Such acute, short-term and non-life-threatening side effects are 
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encompassed in the instant definition of the vaccines of the invention, and vaccines 
causing such side-effects fall within the definition of "attenuated" presented herein. 
Preferred examples of such attenuated cells include attenutated varieties of 
Salmonella species, preferably Salmonella typhi and Salmonella typhimurium, as well 
5 as other attenuated bacterial species. It will be specifically understood that these 

embodiments of the vaccines of the invention encompass so-called "live" attenuated 
cell preparations as well as heat* or chemically-inactivated cell preparations. 

In other embodiments of the invention are provided vaccines that are DNA 
vaccines, comprising the nucleic acids of the invention in recombinant expression 

10 constructs competant to direct expression of hemoglobin receptor proteins when 
introduced into an animal. In preferred embodiments, such DNA vaccines comprise 
recombinant expression constructs wherein the hemoglobin receptor-encoding nucleic 
acids of the invention are operably linked to promoter elements, most preferably the 
early gene promoter of cytomegalovirus or the early gene promoter of simian virus 

15 40. DNA vaccines of the invention are preferably administered by intramuscular 

injection, but any appropriate route of administration, including oral, transdermal, 
rectal, nasal, aerosol administration into lung, or any other clinically-acceptable route 
of administration can be used by those with skill in the art. 

In general, the vaccines of the invention are administered in a manner 

20 compatible with the dosage formulation, and in such amount as will be 

therapeutically effective and immunogenic. The quantity to be administered depends 
on the subject to be treated, capacity of the subject's immune system to synthesize 
antibodies, and the degree of protection desired. Precise amounts of active 
ingredient required to be administered depend on the judgment of the practitioner and 

25 are peculiar to each individual. However, suitable dosage ranges are of the order 

of several hundred micrograms active ingredient per individual. Suitable regimes for 
initial administration and booster shots are also variable, but are typified by an initial 
administration followed in one or two week intervals by a subsequent injection or 
other administration. 

30 The recombinant expression constructs of the present invention are also useful 

in molecular biology to transform bacterial cells which do not ordinarily express a 
hemoglobin receptor protein to thereafter express this receptor. Such cells are 
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useful, inter alia, as intermediates for making cell membrane preparations useful for 
receptor binding activity assays, vaccine production, and the like, and in certain 
embodiments may themselves be used, inter alia, as vaccines or components of 
vaccines, as described above. The recombinant expression constructs of the present 
5 invention thus provide a method for screening potentially useful bactericidal and 

bacteriostatic drugs at advantageously lower cost than conventional screening 
protocols. While not completely eliminating the need for ultimate in vivo activity 
and toxicology assays, the constructs and cultures of the invention provide an 
important first screening step for the vast number of potentially useful bactericidal 

10 and bacteriostatic drugs synthesized, discovered or extracted from natural sources 

each year. In addition, such bactericidal or bacteriostatic drugs would be selected 
to utilize a nutritional pathway associated with infectious virulence in these types of 
bacteria, as disclosed in more detail below, thus selectively targeting bacteria 
associated with the development of serious infections in vivo. 

15 Also, the invention provides both functional bacterial hemoglobin receptor 

proteins, membranes comprising such proteins, cells expressing such proteins, and 
the amino acid sequences of such proteins. This invention thereby provides sufficient 
structural and functional activity information to enable rational drug design of novel 
therapeutically-active antibacterial drugs using currently-available techniques (see 

20 Walters, "Computer- Assisted Modeling of Drugs", in Klegerman & Groves, eds., 

1993, Pharmaceutical Biotechnology . Interpharm Press: Buffalo Grove, IL, pp. 165- 
174). 

Nucleic acids and oligonucleotides of the present invention are useful as 
diagnostic tools for detecting the existence of a bacterial infection in a human, caused 

25 by a hemoglobin receptor protein-expressing pathological organism of Neisseria 

species. Such diagnostic reagents comprise nucleic acid hybridization probes of the 
invention and encompass paired oligonucleotide PCR primers, as described above. 
Methods provided by the invention include blot hybridization, in situ hybridization 
and in vitro amplification techniques for detecting the presence of pathogenic bacteria 

30 in a biological sample. Appropriate biological samples advantageously screened 

using the methods described herein include plasma, serum, lymph, cerebrospinal 
fluid, seminal fluid, mucosal tissue samples, biopsy samples, and other potential sites 
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of bacterial infection. It is also envisioned that the methods of the invention may be 
used to screen water, foodstuffs, pharmaceuticals, and other potential sources of 
infection. 

The invention also provides antibodies that are immunologically reactive to 
5 a bacterial hemoglobin receptor protein or epitopes thereof provided by the 
invention. The antibodies provided by the invention may be raised, using methods 
well known in the art* in animals by inoculation with cells that express a bacterial 
hemoglobin receptor protein or epitopes thereof, cell membranes from such cells, 
whether crude membrane preparations or membranes purified using methods well 

10 known in the art, or purified preparations of proteins, including fusion proteins, 

particularly fusion proteins comprising epitopes of a bacterial hemoglobin receptor 
protein of the invention fused to heterologous proteins and expressed using genetic 
engineering means in bacterial, yeast or eukaryotic cells, said proteins being isolated 
from such cells to varying degrees of homogeneity using conventional biochemical 

15 means. Synthetic peptides made using established synthetic means in vitro and 

optionally conjugated with heterologous sequences of amino acids, are also 
encompassed in these methods to produce the antibodies of the invention. Animals 
that are used for such inoculations include individuals from species comprising cows, 
sheep, pigs, mice, rats, rabbits, hamsters, goats and primates. Preferred animals for 

20 inoculation are rodents (including mice, rats, hamsters) and rabbits. The most 
preferred animal is the mouse. 

Cells that can be used for such inoculations, or for any of the other means 
used in the invention, include any cell that naturally expresses a bacterial hemoglobin 
receptor protein as provided by the invention, or any cell or cell line that expresses 

25 a bacterial hemoglobin receptor protein of the invention, or any epitope thereof, as 
a result of molecular or genetic engineering, or that has been treated to increase the 
expression of an endogenous or heterologous bacterial hemoglobin receptor protein 
by physical, biochemical or genetic means. Preferred cells are E. coli auxotrophic 
mutant hemA aroB cells transformed with a recombinant expression construct of the 

30 invention and grown in media supplemented with hemin or hemoglobin as the sole 
iron (III) source, and cells of Neisseria species. 
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The present invention also provides monoclonal antibodies that are 
immunologically reactive with an epitope of a bacterial hemoglobin receptor protein 
of the invention, or fragment thereof, present on the surface of such cells, preferably 
E. coli cells. Such antibodies are made using methods and techniques well known 
5 to those of skill in the art. Monoclonal antibodies provided by the present invention 

are produced by hybridoma cell lines, that are also provided by the invention and 
that are made by methods well known in the art {see Harlow and Lane, 1988, 
Antibodies: A Labor atory Manual . Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N.Y.). 

10 Hybridoma cell lines are made by fusing individual cells of a myeloma cell 

line with spleen cells derived from animals immunized with a homogeneous 
preparation of a bacterial hemoglobin receptor protein, membranes comprised 
thereof, cells expressing such protein, or epitopes of a bacterial hemoglobin receptor 
protein, used per se or comprising a heterologous or fusion protein construct, as 

15 described above. The myeloma cell lines used in the invention include lines derived 

from myelomas of mice, rats, hamsters, primates and humans. Preferred myeloma 
cell lines are from mouse, and the most preferred mouse myeloma cell line is 
P3X63-Ag8.653. The animals from whom spleens are obtained after immunization 
are rats, mice and hamsters, preferably mice, most preferably Balb/c mice. Spleen 

20 cells and myeloma cells are fused using a number of methods well known in the art, 

including but not limited to incubation with inactivated Sendai virus and incubation 
in the presence of polyethylene glycol (PEG). The most preferred method for cell 
fusion is incubation in the presence of a solution of 45% (w/v) PEG- 1450. 
Monoclonal antibodies produced by hybridoma cell lines can be harvested from cell 

25 culture supernatant fluids from in vitro cell growth; alternatively, hybridoma cells 

can be injected subcutaneousiy and/or into the peritoneal cavity of an animal, most 
preferably a mouse, and the monoclonal antibodies obtained from blood and/or 
ascites fluid. 

Monoclonal antibodies provided by the present invention are also produced 
30 by recombinant genetic methods well known to those of skill in the art, and the 

present invention encompasses antibodies made by such methods that are 
immunologically reactive with an epitope of a bacterial hemoglobin receptor protein 
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of the invention. The present invention also encompasses fragments, including but 
not limited to F(ab) and F(ab)' 2 fragments, of such antibody. Fragments are 
produced by any number of methods, including but not limited to proteolytic 
cleavage, chemical synthesis or preparation of such fragments by means of genetic 
5 engineering technology. The present invention also encompasses single-chain 
antibodies that are immunologically reactive with an epitope of a bacterial 
hemoglobin receptor protein, made by methods known to those of skill in the art. 

The antibodies and fragments used herein can be labeled preferably with 
radioactive labels, by a variety of techniques. For example, the biologically active 

10 molecules can also be labeled with a radionucleotide via conjugation with the cyclic 
anhydride of diethylenetriamine penta-acetic acid (DPTA) or bromoacetyl 
aminobenzyl ethylamine diamine tetra- acidic acid (BABE). See Hnatowich et al. 
(1983, Science 22^: 613-615) and Meares et al. (1984, Anal Biochem. 142: 68-78, 
both references incorporated by reference) for further description of labeling 

15 techniques. 

The present invention also encompasses an epitope of a bacterial hemoglobin 
receptor protein of the invention, comprised of sequences and/or a conformation of 
sequences present in the receptor molecule. This epitope may be naturally occurring, 
or may be the result of proteolytic cleavage of a receptor molecule and isolation of 

20 an epitope-containing peptide or may be obtained by synthesis of an epitope- 
containing peptide using methods well known to those skilled in the art. The present 
invention also encompasses epitope peptides produced as a result of genetic 
engineering technology and synthesized by genetically engineered prokaryotic or 
eukaryotic cells. 

25 The invention also includes chimeric antibodies, comprised of light chain and 

heavy chain peptides immunologically reactive to a bacterial hemoglobin receptor 
protein-derived epitope. The chimeric antibodies embodied in the present invention 
include those that are derived from naturally occurring antibodies as well as chimeric 
antibodies made by means of genetic engineering technology well known to those of 

30 skill in the art. 

Also provided by the present invention are diagnostic and therapeutic methods 
of detecting and treating an infection in a human, by a pathogenic organisms 
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expressing a bacterial hemoglobin receptor protein. Diagnostic reagents for use in 
such methods include the antibodies, most preferably monoclonal antibodies, of the 
invention. Such antibodies are used in conventional immunological techniques, 
including but not limited to enzyme-linked immunosorbent assay (ELISA), 
5 radio immune assay (RIA), Western blot assay, immunological titration assays, 

immunological diffusion assays (such as the Ouchterlony assay), and others known 
to those of skill in the art. Also provided are epitopes derived from a bacterial 
hemoglobin receptor protein of the invention and immunologically cross-reactive to 
said antibodies, for use in any of the immunological techniques described herein. 

10 Additional diagnostic assays include nucleic acid hybridization assays, using 

the nucleic acids of the invention or specifically-hybridizing fragments thereof, for 
sensitive detection of bacterial genomic DNA and/or mRNA. Such assays include 
various blot assays, such as Southern blots, Northern blots, dot blots, slot blots and 
the like, as well as in vitro amplification assays, such as the polymerase chain 

15 reaction assay (PCR), reverse transcriptase-polymerase chain reaction assay (RT- 

PCR), ligase chain reaction assay (LCR), and others known to those skilled in the 
art. Specific restriction endonuclease digestion of diagnostic fragments detected 
using any of the methods of the invention, analogous to restriction fragment linked 
polymorphism assays (RFLP) are also within the scope of this invention. 

20 The invention also provides therapeutic methods and reagents for use in 

treating infections in a human, cause by a microorganism expressing a bacterial 
hemoglobin receptor protein of the invention, most preferably a bacteria of Neisseria 
species. Therapeutic reagents for use in such methods include the antibodies, most 
preferably monoclonal antibodies, of the invention, either per se or conjugated to 

25 bactericidal or bacteriostatic drugs or other antibiotic compounds effective against the 

infectious microorganism. In such embodiments, the antibodies of the invention 
comprise pharmaceutical compositions, additionally comprising appropriate 
pharmaceutically-acceptable carriers and adjuvants or other ancillary components 
where necessary. Suitable carriers are, for example, water, saline, dextrose, 

30 glycerol, ethanol, or the like and combinations thereof. In addition, if desired, the 

pharmaceutical formulation may contain minor amounts of auxiliary substances such 
as wetting or emulsifying agents, pH buffering agents, or other compounds which 
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enhance the effectiveness of the antibody. In these embodiments, it will be 
understood that the therapeutic agents of the invention serve to target the infectious 
bacteria, either by immunologically "tagging" the bacteria with an antibody of the 
invention for recognition by cytotoxic cells of a human's immune system, or by 
5 specifically delivering an antimicrobial drug to the infectious microorganism via the 

bacterial hemoglobin receptor protein. 

Additional therapeutic reagents include the nucleic acids of the invention or 
fragments thereof, specifically antisense embodiments of such nucleic acids. Such 
antisense nucleic acids may be used themselves or embodied in a recombinant 

10 expression construct specific for antisense expression, wherein said construct is 
genetically engineered to co-opt a portion of the genome of a bacterial virus, 
preferably a bacteriophage, infectious for the bacterial pathogen responsible for the 
infection. In these embodiments, introduction of the antisense nucleic acids of the 
invention into the bacterial cell inhibits, attentuates or abolishes expression of the 

15 bacterial hemoglobin receptor, thereby reducing the virulence of the bacterial 

infection and enabling more effective antibacterial interventions. In additional 
embodiments, bacteriophage are provided bearing "knockout" copies of a bacterial 
hemoglobin receptor gene, whereby the phage achieves genetic mutation of the 
endogenous hemoglobin receptor gene in the infectious bacteria via, for example, 

20 homologous recombination of the exogenous knockout copy of the bacterial 
hemoglobin receptor gene with the endogenous hemoglobin receptor gene in the 
infectious microorganism. 

The Examples which follow are illustrative of specific embodiments of the 
invention, and various uses thereof. They set forth for explanatory purposes only, 

25 and are not to be taken as limiting the invention. 

EXAMPLE 1 
Masmids. bacteria, and media 
Plasmids and bacteria used herein are listed on Table 1. E. coli strains were 
30 routinely grown in Luria-Bertani (LB) broth supplemented with 5-aminolevulinic acid 

and 50mg/L hemin chloride as necessary. N. meningitidis 8013 is a serogroup C 
clinical isolate (Nassif et al. , 1993, Mol Microbiol. 8: 719-725). The meningococci 
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were routinely grown on GCB agar (Difco) supplemented as described by Kellogg 
et aL (1963, J. Bacteriol 85: 1274-1279), and incubated at 37°C under a 5% C0 2 
atmosphere. Transformation of meningococci was performed as described by Nassif 
etaL (1992, MoL Microbiol. 6: 591-597). When necessary, the following antibiotics 
5 were used with E. coli: rifampicin, 100 mg/L; tetracycline, 15 mg/L; kanamycin, 

30 mg/L; chloramphenicol, 20 mg/L; carbenicillin, 100 mg/L. For Neisseriae, 
kanamycin at 100 mg/L was used when needed. 

EXAMPLE 2 

10 Auxotroph Complementation Cloning of a hemoglobin Receptor Gene from 

Neisseria meningitidis 

In order to identify TV. meningitidis outer membrane receptor(s) involved in 

the uptake of haemin and/or haemoglobin iron, an auxotroph complementation 

cloning strategy was used, similar to the approach previously taken to identify the 

15 Y. enterocolitica and V. cholerae hemin receptors {see Stojiljkovic and Hantke, 1992, 

EMBO J. 11: 4359^367; Henderson and Payne, 1994, 7. Bacteriol. 176: 3269- 
3277). This strategy is based on the fact that the outer membrane of Gram-negative 
bacteria is impermeable to hemin (McConville and Charles, 1979, J. Microbiol. 113 : 
165-168) and therefore E. coli porphyrin biosynthesis mutants cannot grow on 

20 exogenously supplied hemin. If provided with the N. meningitidis outer membrane 

hemin receptor gene, the E. coli porphyrin mutant would be able to use exogenously 
supplied hemin as its porphyrin source. 

A cosmid bank of N. meningitidis 8013 clone 6 DNA was prepared using 
conventional cosmid cloning methodologies (Sambrook et aL, 1989, ibid.). N. 

25 meningitidis bacterial DNA was partially digested by Mbol, size fractionated on 

sucrose gradients and cloned into the BamUl site of the cosmid vector pLAFR2 
(Riboli et aL, 1991, Microb. Pathogen. 10: 393-403). This cosmid bank was 
mobilized into the E. coli hemA aroB Rif r recipient strain by uniparental matings 
using a conjugal plasmid pRK2013: :Tn9. The mating mixture was plated onselective 

30 plates containing hemin chloride (50mg/L), 0.1 mM 2,2'-dypyridil and rifampicin 

(100 mg/L). Several clones growing on exogenously supplied haemin were isolated 
after an overnight incubation. 
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TABLE I 



STRAIN 


GENOTYPE 


E. coli K12 




EB53 


hemA, aroB, rpoB 


KP1041 


MC4lQQtonB::Km r 


H1388 


exbB::TnJ0 Alac pro 


TSM348 


endA, hsdR, pro, supF, pRK2013::Tn9 


IR754 


EB53. tonB • Knf 


IR736 


EB53 exhR- Tn/rt 


DH5a 




N. meningitidis 




ATCC 13077 


Serotype A 




Serotype B* 


MC8013 


clone 6, wild type 


MChmbR 


hmbR::aphA-3 



N. gonorrhoeae MS11A 



PLASMIDS 




pSUSK 


pAlS replicon, chloramphenicol' 


pHEM22 


pLAFR2, hemoglobin-utilizing cosmid 


pHEM44 


pLAFR2, hemin-utilizing cosmid 


pIRS508 


6kb Clal, pSUSK 


pIRS523 


3kb BamHI/Saa, pUC19 


pIRS525 


1.2kb aphA-3, in Notl site of pIRS523 


pIRS527 


4kb BamHVClal, pBluescript 


pIRS528 


0.7kb NotUBamHL, pBluescript 


pIRS692 


3.3kb BamHVHindm, SU(SK) 



* Laboratory collection 



- 33 - 



BNSOOCID: <WO 9612020A3JA> 



WO 96/12020 



• 



PCT/US95/13623 



The hemin utilization phenotype of these transformants was tested by re- 
introduction of the cosmids into naive E. coli hemA aroB cells and by monitoring the 
growth on hemin-supplemented plates. The ability of E. coli strains to utilize heme 
or hemoglobin as the sole iron source was tested as previously described (Stojiljkovic 
5 and Hantke, 1992, ibid,). Cells were grown on LB agar supplemented with 50/xM 

deferoxamine mesylate (an iron chelating agent, obtained from Sigma Chemical Co., 
St. Louis, MO). Filter discs (1/4 inches, Schleichner & Schuell, Inc., Keene, NH.) 
impregnated with the test compounds (20 /xL of 5 mg/ml stock solutions unless 
otherwise stated) were placed on these plates. After overnight growth at 37°C with 

10 5% C0 2 , zones of growth around the discs were monitored. The iron-bound proteins 

tested in this assay (all obtained from Sigma Chemicals Co.) were hemoglobin from 
human, baboon, bovine and mouse sources, bovine hemin, human lactoferrin (90% 
iron saturated), and human transferrin (90% iron saturated, obtained from Boehringer 
Mannheim Biochemicals, Indianapolis, IN). A total of six hemin utilization positive 

15 cosmids were obtained using this protocol. Results using such assays are shown in 

Table II. 

EXAMPLE 3 

Restriction Enzyme Digestion Mapping of Hemin Utilization 
20 Positive Cosmids 

Cosmid DNA from six hemin-utilization positive cosmids obtained as 

described in Example 2 were digested with Clal, and the resulting fragments were 

cloned into Ctel-digested pSU(SK) vector (obtained from Stratagene, LaJolla, CA). 

One subclone, containing a 6 kb Clal fragment from cosmid cos22 (the resultant 

25 plasmid was designated pIRS508), was determined to allow utilization of hemin and 

hemoglobin by E. coli hemA aroB assayed as described in Example 2. Another such 
clone, containing an 11 kb Clal fragment from cos44 was also determined to allow 
hemin utilization in these auxotrophic mutant cells. Restriction analysis and Southern 
hybridization indicated that the DNA fragments originating from cos22 and cos44 are 

30 unrelated. 

The deduced restriction enzyme digestion map of cosmid clone pIRS508 is 
shown in Figure 1 . Plasmid pIRS508 enabled E, coli hemA aroB to use both hemin 
and bovine hemoglobin as iron sources although growth on hemoglobin was 
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somewhat weaker than on hemin (Table II). Further subcloning localized the 
hemin/hemoglobin utilization locus to the BamUVHindni fragment of the insert. In 
addition to sequences encoding the hemoglobin receptor gene (designated hmbR), 
sequences for a Neisseria insertion element (IS1106) and a portion of a Neisseria 
5 small repetitive element (IRi) are also represented in the Figure. 

EXAMPLE 4 

Nucleotide Sequence Analysis of a Cosmid Clone Encoding 
a Neisseria Hemoglobin Receptor Gene 

10 The nucleotide sequence of the 3.3 kb BamHl-Hindni DNA fragment 

carrying the hmbR gene and its promoter region was determined using the dideoxy 
chain termination method using a Sequenase 2.0 kit (obtained from U.S. 
Biochemicals, Cleveland, OH) and analyzed using a BioRad electrophoresis system, 
an AutoRead kit (obtained from Pharmacia, Uppsala, SE) and an ALF-370 automatic 

15 sequenator (Pharmacia, Uppsala, Sweden). Plasmid subclones for sequencing were 

produced by a nested deletion approach using Erase-a-Base kit (obtained from 
Promega Biotech, Madison, WI) using different restriction sites in the hmbR gene. 
The nucleotide and predicted amino acid sequences of the hmbR gene are shown in 
Figure 2 

20 An open reading frame (ORF) encoding the N. meningitidis, serotype C 

hemoglobin receptor protein begins at position 470 of the sequence and encodes a 
protein having an amino acid sequence of 792 amino acids, with a calculated 
molecular weight of 85.5 kDa. A Shine-Delgarno sequence (SD) is found at position 
460. The HmbR receptor protein contains a signal peptidase I recognition sequence 

25 at residues 22 to 24 of the protein (underlined) , consistent with the fact that it is an 

outer membrane protein. 

A typical Fur binding nucleotide sequence (designated "Fur box") was found 
in the promoter region of the hmbR gene (Figure 2). Like hemin utilization in 
Yersiniae and Vibrio, hemin and hemoglobin utilization in Neisseria are known to be 

30 iron-inducible phenotypes (West and Sparling, 1985, Infect. Immun. 47: 388-394; 

Dyer et al. t 1987, Infect. Immun, 55: 2171-2175). In Gram-negative bacteria, 
conditional expression of many iron utilization genes is regulated by the Fur 
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repressor, which recognizes a 19 bp imperfect dyad repeat (Fur-box) in the promoter 
regions of Fur-repressed genes. Recently, a genetic screen (FURTA) for the 
identification of Fur-regulated genes from different Gram-negative bacteria was 
described (Stojiljkovic et al. y 1994, 7. MoL Biol. 23g: 531-545), and this assay was 
5 used to test whether hmbR expression was controlled in this way. Briefly, a plasmid 

carrying a Fur-box sequence is transformed into an E. coli strain (H1717) which 
possesses a Fur-regulated lac fusion in the chromosome. Expression of this Fur- 
regulated lac fusion is normally repressed. Introduction of a multicopy Fur-box 
sequence on the plasmid titrates the available Fur repressor thus allowing expression 

10 of the Fur-regulated lac fusion (this phenotype is termed FURTA positive). Using 
this screen, the smallest insert fragment from cosmid pERS508 that produced a 
FURTA positive result was a 0.7 kb BamHI-Notl DNA fragment carried on plasmid 
pIRS528 (see Figure 1). This result indicated that the 0.7 kb BamHl-Nofi fragment 
carries a Fur-box and that gene expression from the hmbR promoter is controlled by 

15 a fur-type operon. 

N. meningitidis, serotype C hemoglobin receptor protein was expressed in 
vitro using an E. coli S30 extract system from Promega Biotech (Madison, WI). The 
3.3 kb BamHl-HindUl fragment, expressed in vitro, encoded a 90kDa protein which 
corresponds in size to the predicted molecular weight of the unprocessed HmbR 

20 receptor. SDS/ 10% PAGE analysis showing the observed M r of 90K is shown in 

Figure 3. 

Immediately downstream of the hmbR gene (at positions 2955 to 3000 bp in 
Figure 2) was found a short nucleotide sequence that is 99% identical to the flanking 
sequence of the PHI gene of N. gonorrhoeae (Gotschlich et aL , 1987, 7. Exp. Med. 

25 16^: 471-482). The first 26 bp of this sequence represents one half of the inverted 

repeat (IR1) of the N. gonorrhoeae small repetitive element. This element is found 
in approximately 20 copies in both N. gonorrhoeae and N. meningitidis (Correia et 
al., 1988, J. Biol Chem. 263: 12194-12198). The analysis of the nucleotide 
sequence from position 3027 to the Clal (3984) restriction site (only the nucleotide 

30 sequence from BamHl (1) to //mdm (3370) is shown in Figure 2) indicated the 

presence of an IS1106 element (Knight et al. , 1992, MoL Microbiol. 6: 1565-1573). 
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Interestingly, no nucleotide sequence similar to the IS 11 06 inverted repeat was found 
between the IR1 element and the beginning of the homology to IS 1106. 

These results were consistent with the cloning and identification of a novel 
hemoglobin receptor protein gene from N. meningitidis, embodied in a 3.3kb 
5 BamHL/HindHl fragment of N. meningitidis genomic DNA. 

EXAMPLE 5 

Amino Acid Sequence Comparison of the N. meningitidis 
Hemoglobin Receptor Protein and Neisseria 
10 Lactoferrin and Transferrin Receptor Proteins 

A comparison of the transferrin (Tbpl; Legrain et aL, 1993, Gene 130: 81- 

90), lactoferrin (LbpA; Pettersson etaL, 1993, Infect. Immun. 61: 4724-4733, and 

1994, 7. BacterioL 176: 1764-1766) and hemoglobin receptors (HmbR) from N. 

meningitidis is shown in Figure 4. The comparison was done with the CLASTAL 

15 program from the PC/GENE program package (Intelligenetics, Palo Alto, CA). 

Only the ammo-terminal and carboxyl terminal segments of the proteins are shown. 
An asterisk indicates identity and a point indicates similarity at the amino acid level. 
Lactoferrin and transferrin receptors were found to share 44.4% identity in amino 
acid sequence. In contrast, homology between these proteins and the hemoglobin 

20 receptor disclosed herein was found to be significantly weaker (22% amino acid 

sequence identity with lactoferrin and 21% with transferrin receptor). 

EXAMPLE 6 

TonB/ExbBD-Dependence of Hemin Transport by the N. meningitidis 
25 Hemoglobin Receptor 

It was known that the transport of iron-containing siderophores, some colicins 

and vitamin B12 across the outer membrane of £. coli depends on three cytoplasmic 

membrane proteins: TonB, ExbB and ExbD (Postle 1990, Mol. MicrobioL 133: 891- 

898; Braun and Hantke, 1991, in Winkelmann, (ed.), Handbook of Microbial Iron 

30 Chelates . CRC Press, Boca Raton, Fla., pp. 107-138). In Yersinia and Hemophilus, 

hemin uptake was shown to be a TonB-dependent process (Stojiljkovic and Hantke, 

1992, ibid.; Jarosik et aL, 1994, Infect. Immun. 62: 2470-2477). Through direct 

interaction between the outer membrane receptors and the TonB cytoplasmic 
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10 



15 



20 



25 



machinery, the substrate bound to the receptor is internalized into the periplasm 
(Heller et aL, 1988, Gene 64: 147-153; Schoffler and Braun, 1989, Molec. Gen. 
Genet. 217 : 378-383). This direct interaction has been associated with a particular 
amino acid sequence in membrane proteins associated with the TonB machinery. 

All TonB-dependent receptors in Gram-negative bacteria contain several 
regions of high homology in their primary structures (Lundrigan and Kadner, 1986, 
J. Biol. Chem. 261: 10797-10801). In the amino acid sequence comparison 
described in Example 5, putative TonB-boxes of all three proteins are underlined. 
The carboxyl terminal end of the HmbR receptor contains the highly conserved 
terminal phenylalanine and position 782 arginine residues thought to be part of an 
outer membrane localization signal (Struyve et al. , 1991 , 7. MoL Biol. 218 : 141-148; 
Koebnik, 1993, Trends Microbiol 1: 201). At residue 6 of the mature HmbR 
protein, an amino acid sequence - ETTPVKA - is similar in sequence to the so called 
TonB-boxes of several Gram-negative receptors (Heller et aL 9 1988, ibid.). 
Interestingly, the putative TonB-box of HmbR has more homology to the TonB-box 
of the N. gonorrhoeae transferrin receptor (Cornelissen et al. , 1992, /. BacterioL 
174 : 5788-5797) than to the TonB-boxes of E. coli siderophore receptors. When the 
sequence of the HmbR receptor was compared with other TonB-dependent receptors, 
the highest similarity was found with Y. enterocolitica HemR receptor although the 
similarity was not as high as to the Neisseria receptors. 

In order to prove the TonB-dependent nature of the N. meningitidis, serotype 
C hemoglobin receptor, hmbR was introduced into exbB and tonB mutants of E. coli 
EB53, and the ability of the strains to utilize hemin and hemoglobin as porphyrin and 
iron sources was assessed. In these assays, both mutants of E. coli EB53 were 
unable to use hemin either as a porphyrin source or as an iron source in the presence 
of a functional hmbR (Table 2). The usage of hemoglobin as an iron source was also 
affected (Table 2). These results are consistent with the notion that the hmbR gene 
product, the N. meningitidis hemoglobin receptor protein of the invention, is TonB- 
dependent, since expression of this gene in TonB wild type E. coli supported the use 
of hemin and hemoglobin as sole iron source in the experiments disclosed in 
Example 2. 
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EXAMPLE 7 

Functional Demonstration that the hmbR Gene Product is the 
Hemoglobi n Receptor Protein in N. meningitidis 

As shown in the data presented in Table II, hmbR mediated both hemin and 

5 hemoglobin utilization when expressed in E. coli, but hemoglobin utilization was less 

vigorous than hemin utilization. To determine if the HmbR receptor has the same 

specificity in TV. meningitidis, hmbR was inactivated with a 1.2kb kanamycin cassette 

(aphA-3; Nassif et al y 1991, ibid.) and transformed into wild-type N. meningitidis 

8013 clone 6 (serotype C) cells. The inactivation of the chromosomal hmbR copy 

10 of the Km-resistant transformants was confirmed by Southern hybridization, as 

shown in Figure 5. As can be seen from Figure 5, wild-type N. meningitidis 
genomic DNA contains only one copy of the hmbR gene (lanes 1 and 3). In the Km r 
transformants, the size of the DNA fragments containing the wild- type gene has 
increased by 1.2 kb, which is the size of the Kan cassette (Figure 5, lanes 2 and 4). 

15 When tested for its ability to utilize different iron-containing compounds, these 

mutant cells were found to be unable to use hemoglobin-bound iron, regardless of 
the source (human, bovine, baboon, mouse). The ability of the mutant to utilize 
hemoglobin-haptoglobin was not tested because the wild-type N. meningitidis strain 
is unable to use haptoglobin-haemoglobin complex as an iron source. However, the 

20 mutant was still able to use hemin iron, lactoferrin- and transferrin-bound iron as 

well as citrate-iron (Table II). As the iron-containing component of hemoglobin is 
hemin, a hemoglobin receptor would be expected to be capable of transporting hemin 
into the periplasm. Indeed, the cloning strategy disclosed herein depended on the 
ability of the cloned meningococcal receptor to transport hemin into the periplasm 

25 of E. coli. These results strongly suggest that N. meningitidis has at least two 

functional receptors that are involved in the internalization of hemin-containing 
compounds. One is the hemoglobin receptor described herein, which allows the 
utilization of both hemin and hemoglobin as iron sources. The other putative 
receptor in N. meningitidis is a hemin receptor which allows utilization of only 

30 hemin. This schema is also consistent with the isolation of several cosmid clones 
that allow E. coli EB53 to utilize hemin. DNAs from these cosmids do not hybridize 
with our hmbR probe , indicating that these clones encode a structurally-distinct 

-40- 

BNSDOCID:<WO 9612020A3_IA> 



WO 96/12020 




PCTYUS95/13623 



receptor protein capable of transporting hemin into the periplasm of N. meningitidis 
cells. 

EXAMPLE 8 

5 Attenuation of Virulence in hmbR Mutant N. meningitidis Cells In Vivo 

In order to test the importance of hemoglobin and hemin scavenging systems 
of N. meningitidis in vivo, the hmbR -mutant and the wild type strain of N. 
meningitidis , serotype C were inoculated into 5 day old infant rats and the numbers 
of bacteria recovered from blood and cerebrospinal fluid were followed. In these 

10 experiments, the method for the assessing N. meningitidis, serotype C virulence 
potential was essentially the same as described by Nassif et aL (1992, ibid.) using 
infant inbred Lewis rats (Charles River, Saint Aubin les Elbeufs, France). Inbred 
rats were used to minimize individual variations. Briefly, the 8013 strain was 
reactivated by 3 animal passages. After the third passage, bacteria were kept frozen 

15 in aliquots at -80° C. To avoid the possibility that modifications in the course of 

infection could result from selection of one spontaneous avirulent variant, one aliquot 
from the animal-passed frozen stock of 8013 was transformed with chromosomal 
DNA from the hmbR mutant, the resultant Kan r transformants were pooled without 
further purification and kept frozen at -80 °C. For each experiment, all infant rats 

20 were from the same litter. N. meningitidis 8013 was grown overnight and 2 X 10 6 
bacteria injected intraperitoneal^ into the infant rat. Three rats were used for each 
meningococcal strain. The course of infection was followed over a 24 hours time 
period with blood collected at the indicated times. At the 24 h time period, the rats 
were sacrificed, the cerebrospinal fluid (CSF) collected and the number of colony- 

25 forming units (CFU) determined. Each experiment was performed in replicate; 

similar results were obtained both times. 

The results of these experiments are shown in Figure 6. The hmbR * strain, 
which is unable to use hemoglobin as an iron source, was recovered from the blood 
of infected animals in significantly lower numbers when compared with the wild type 

30 strain. Both the mutant and the wild type strain were still able to cross the blood- 

brain barrier as indicated by the isolation of bacteria from the cerebrospinal fluid. 
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These results indicate that hemoglobin represents an important iron source for N. 
meningitidis during growth in vivo. 

EXAMPLE 9 

Polymerase Chain Reaction Amplification of Hemoglobin Receptor 
Genes from N. meningitidis S erotypes and N. gonorrhoeae 

From the nucleotide sequence of the 3.3 kb BamHl-Hindni DNA fragment 
carrying the hmbR gene and its promoter region was determined specific 
oligonucleotide promers for in vitro amplification of the homologous hemoglobin 
receptor protein genes from N. meningitidis serotypes A and B and AT. gonorrhoeae 
MSI 1 A as follows. 

The following oligonucleotide primers were developed for in vitro 
amplificaiton reactions using the polymerase chain reaction (PCR; Saiki et aL , 1988, 
Science 230: 1350-1354): 

5 '-AAACAGGTCTCGGCATAG-3 ' (sense primer) (SEQ ID No.: 11) 

5 '-CGCGAATTCAAACAGGTCTCGGCATAG-3 ' (SEQ ID No. : 12) 

(antisense primer) 

for amplifying the hemoglobin receptor protein from N. meningitidis, serotype A; 

5 '-CGCGAATTCAAAAACTTCCATTCCAGCGATACG-3 ' (SEQ ID No. : 13) 
(sense primer) 

5 '-TAAAACTTCCATTCC AGCGATACG-3 ' (antisense primer) (SEQ ID No.: 14) 
for amplifying the hemoglobin receptor protein from N. meningitidis, serotype B; 
5 '-AAACAGGTCTCGGCATAG-3 ' (sense primer) (SEQ ID No. : 15) 

or 

5 '-CGCGAATTCAAACAGGTCTCGGCATAG-3 ' (SEQ ID No. : 16) 

(sense primer) 

and 

5 '-CGCGAATTCAAAAACTTCCATTCC AGCGATACG-3' (SEQ ID No. : 17) 
(antisense primer) 

or 

5 '-TAAAACTTCCATTCC AGCGATACG-3 ' (antisense primer) (SEQ ID No.: 18) 
for amplifying the hemoglobin receptor protein from N. gonorrhoeae MS11A. 

Genomic DNA from N. meningitidis serotype A or B or N. gonorrhoeae 
species was prepared using standard techniques (see Sambrook, et aL, ibid,), 
including enzymatic degradation of bacterial cell walls, protoplast lysis, protease and 
RNase digestion, extraction with organic solvents such as phenol and/or chloroform, 
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and ethanol precipitation. Crude DNA preparations were also used. An amount 
(typically, about 0.1 fig) of genomic DNA was used for each amplification reaction. 
A PGR amplification reaction consisted of Pfu polymerase (Stratagene, LaJolIa, CA) 
and/or Taq polymerase (Boehringer Mannheim, Germany) in the appropriate buffer 
including about 20picomoles of each amplificaiton primer and 200nanomoles of each 
deoxynucleoside triphosphate. Amplification reactions were performed according to 
the following scheme: 



First cycle 5 min at 95 °C 

2 min at 51°C 
6 min at 72°C 

Cycles 2-13 45 sec at 95 °C 

35 sec at 49 °C 
10 min at 72 °C 



Cycles 14-30 25 sec at 95 °C 

35 sec at 47°C 
10 min at 72°C 

Upon completion of the amplification reaction, DNA fragments were cloned either 
blunt-ended or, after EcoRI digestion, into EcoRI digested pSUKS or pWKS30 
vectors and transformed into bacteria. Positively-selected clones were then analyzed 
for the presence of recombinant inserts, which were sequenced as described above 
in Example 4. 

As a result of these experiments, three clones encoding the hemoglobin 
receptor genes from N. meningitidis serotypes A and B and N. gonorrhoeae MSI 1 A 
were cloned and the sequence of these genes determined. The nucleic acid sequence 
for each of these genes are shown in Figures 7 (N. meningitidis, serotype A), 8 (N. 
meningitidis, serotype A) and 9 (N. gonorrhoeae MS11A). 

The degree of homology between the cloned hemoglobin receptors from the 
different N. meningitidis serotypes and N. gonorrhoeae MS11A was assessed by 
nucleic acid and amino acid sequence comparison, as described in Example 5 above. 
The results of these comparisons are shown in Figures 10 and 11, respectively. 
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Hemoglobin receptor genes from the three TV. meningitidis serotypes and AT. 
gonorrhoeae MS11A were found to be from 86.5% to 93.4% homologous; the most 
homologous nucleic acids were TV. meningitidis serotypes B and C, and the most 
divergent nucleic acids were TV. meningitidis serotype B and TV. gonorrhoeae MSI 1A 
5 (Figure 10 and Table III). Homoglobin receptor proteins from all four Neisseria 

species showed a high degree of homology to the other members of the group, 
ranging from 87% homology between the hemoglobin receptor proteins from TV. 
gonorrhoeae MS11A and TV. meningitidis serotype B to 93% homology between 
hemoglobin receptor proteins from TV. meningitidis serotypes A and B (Figure 11). 

10 In this comparison, all four receptors were found to share 84.7% amino acid 

sequence identity, and up to 11.6% sequence similarity (i.e., chemically-related 
amino acid residues at homologous sites within the amino acid sequence). The non- 
conserved amino acids were found clustered in the regions of the amino acid 
sequence corresponding to the external loops in the predicted topographical structure 

15 of the hemoglobin receptor proteins. 

It should be understood that the foregoing disclosure emphasizes certain 
specific embodiments of the invention and that all modifications or alternatives 
equivalent thereto are within the spirit and scope of the invention as set forth in the 
appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Oregon Health Sciences University 

(B) STREET; 3181 S.W. Sam Jackson Park Road 

(C) CITY: Portland 

( D ) STATE : Oregon 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 97201-3098 

(G) TELEPHONE: S03-494-8200 

(H) TELEFAX: (503 ) -494-4729 

(ii) TITLE OF INVENTION: A Novel Bacterial Hemoglobin Receptor 
and Uses 

(iii) NUMBER OF SEQUENCES: 18 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER : PCT/US95/ 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..2373 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG AAA CCA TTA CAA ATG CTC CCT ATC GCC GCG CTG GTC GGC AGT ATT 4 8 

Met Lys Pro Leu Gin Met Leu Pro lie Ala Ala Leu Val Gly Ser lie 
15 10 15 
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TTC GGC AAT CCG GTC TTG GCA GCA GAT GAA GCT GCA ACT GAA ACC ACA 96 
Phe Gly Asn Pro Val Leu Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

CCC GTT AAG GCA GAG ATA AAA GCA GTG CGC GTT AAA GGT CAG CGC AAT 144 
Pro Val Lys Ala Glu He Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 

GCG CCT GCG GCT GTG GAA CGC GTC AAC CTT AAC CGT ATC AAA CAA GAA 192 
Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg He Lys Gin Glu 
50 55 60 

ATG ATA CGC GAC AAT AAA GAC TTG GTG CGC TAT TCC ACC GAT GTC GGC 240 
Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

TTG AGC GAC AGC GGC CGC CAT CAA AAA GGC TTT GCT GTT CGC GGC GTG 288 
Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

GAA GGC AAC CGT GTC GGC GTG AGC ATA GAC GGT GTA AAC CTG CCT GAT 3 36 

Glu Gly Asn Arg Val Gly Val Ser He Asp Gly Val Asn Leu Pro Asp 
100 105 110 

TCC GAA GAA AAC TCG CTG TAC GCC CGT TAT GGC AAC TTC AAC AGC TCG 384 
Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGT TTG TCT ATC GAC CCC GAA CTC GTA CGC AAT ATT GAA ATC GTG AAG 432 
Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Glu He Val Lys 
130 135 140 

GGC GCA GAC TCT TTC AAT ACC GGC AGT GGT GCA TTG GGC GGC GGT GTG 480 
Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

AAT TAC CAA ACG CTG CAA GGC CGT GAT TTG CTG TTG GAC GAC AGG CAA 528 
Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Asp Asp Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAC AGC ACG CGT AAC CGT GAA TGG 576 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 190 

ACA AAT ACC CTC GGT TTC GGT GTG AGT AAC GAC CGC GTG GAT GCT GCT 624 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGG CGC GGC CAT GAA ACC GAA AGC GCG GGC AAC 672 
Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Asn 
210 215 220 

CGC GGC TAT CCG GTA GAA GGT GCG GGT AAA GAA ACG AAT ATC CGC GGT 720 
Arg Gly Tyr Pro Val Glu Gly Ala Gly Lys Glu Thr Asn He Arg Gly 
225 230 235 240 

TCC GCC CGC GGC ATC CCC GAT CCG TCC AAA CAC AAA TAC CAC AAC TTC 768 
Ser Ala Arg Gly He Pro Asp Pro Ser Lys His Lys Tyr His Asn Phe 
245 250 255 

TTG GGT AAG ATT GCT TAT CAA ATC AAC GAC AAC CAC CGC ATC GGC GCA 816 
Leu Gly Lys He Ala Tyr Gin He Asn Asp Asn His Arg He Gly Ala 
260 265 270 

TCG CTC AAC GGT CAG CAG GGG CAT AAT TAC ACG GTT GAA GAG TCT TAT 864 
Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 
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AAC CTG ACC GCT TCT TCC TGG CGC GAA GCC GAT GAC GTA AAC AG A CGG 912 
Asn Leu Thr Ala Ser Ser Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

CGC AAT GCC AAC CTC TTT TAC GAA TGG ATG CCT GAT TCA AAT TGG TTG 960 
Arg Asn Ala Asn Leu Phe Tyr Glu Trp Met Pro Asp Ser Asn Trp Leu 
305 310 315 320 

TCG TCT TTG AAG GCG GAC TTC GAT TAT CAG AAA ACC AAA GTG GCG GCG 1008 
Ser Ser Leu Lys Ala Asp Phe Asp Tyr Gin Lys Thr Lys Val Ala Ala 
325 330 335 

ATT AAC AAA GGT TCG TTC CCG ACG AAT TAC ACC ACA TGG GAA ACT GAG 1056 
lie Asn Lys Gly Ser Phe Pro Thr Asn Tyr Thr Thr Trp Glu Thr Glu 
340 345 350 

TAC CAT AAA AAG GAA GTT GGC GAA ATA TAC AAC CGC AGC ATG GAC ACC 1104 
Tyr His Lys Lys Glu Val Gly Glu He Tyr Asn Arg Ser Met Asp Thr 
355 360 365 

CGA TTC AAA CGT TTT ACT TTG CGT TTG GAC AGC CAT CCG TTG CAA CTC 1152 
Arg Phe Lys Arg Phe Thr Leu Arg Leu Asp Ser His Pro Leu Gin Leu 

370 375 380 

GGG GGG GGG CGA CAC CGC CTG TCG TTT AAA ACT TTC GCC AGC CGC CGT 1200 
Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Ser Arg Arg 
3B5 390 395 400 

GAT TTT GAA AAC CTA AAC CGC GAC GAT TAT TAC TTC AGC GGC CGT GTT 1248 
Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Gly Arg Val 
405 410 415 

GTT CGA ACC ACC AGC AGT ATC CAG CAT CCG GTG AAA ACC ACC AAC TAC 1296 
Val Arg Thr Thr Ser Ser He Gin His Pro Val Lys Thr Thr Asn Tyr 
420 425 430 

GGT TTC TCA CTG TCT GAC CAA ATT CAA TGG AAC GAC GTG TTC AGT AGC 1344 
Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe Ser Ser 
435 440 445 

CGC GCA GGT ATC CGT TAC GAC CAC ACC AAA ATG ACG CCT CAG GAA TTG 13 92 

Arg Ala Gly He Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu Leu 
450 455 460 

AAT GCC GAG TGT CAT GCT TGT GAC AAA ACA CCA CCT GCA GCC AAC ACT 144 0 

Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn Thr 
465 470 475 480 

TAT AAA GGC TGG AGC GGT TTT GTC GGC TTG GCG GCG CAA CTG AAT CAG 14 88 

Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Asn Gin 
485 490 495 

GCT TGG CGT GTC GGT TAC GAC ATT ACT TCC GGC TAC CGT GTC CCC AAT 1536 
Ala Trp Arg Val Gly Tyr Asp He Thr Ser Gly Tyr Arg Val Pro Asn 
500 505 510 

GCG TCC GAA GTG TAT TTC ACT TAC AAC CAC GGT TCG GGT AAT TGG CTG 1584 
Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn Trp Leu 
515 520 525 

CCC AAT CCC AAC CTG AAA GCC GAG CGC AGC ACC ACC CAC ACC CTG TCT 1632 
Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu Ser 
530 535 540 

CTG CAA GGC CGC AGC GAA AAA GGC ATG CTG GAT GCC AAC CTG TAT CAA 1680 
Leu Gin Gly Arg Ser Glu Lys Gly Met Leu Asp Ala Asn Leu Tyr Gin 
545 550 555 560 
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AGC 


AAT 


TAC 


CGC 


AAT 


TTC 


Liu 






uAu 


CAu 




CTG 




ALL 




J. / <t O 


Ser 


Asn 


Tyr 


Arg 


Asn 


Phe 


Leu 


Ser 


Glu 


Glu 


Gin 


Lys 


Leu 


Thr 


Thr 


Ser 












565 










570 










575 






GGC 


ACT 


CCC 


GGC 


TGT 


ACT 


GAG 


GAA 


AAT 


GCT 


TAC 


TAC 


AGT 


ATA 


TGC 


AGC 


1776 


Gly 


Thr 


Pro 


Gly Cys 


Thr 


Glu 


Glu 


Asn 


Ala 


Tyr 


Tyr 


Ser 


He 


Cys 


Ser 








580 










585 










590 








GAC 


CCC 


TAC 


AAA 


GAA 


AAA 


CTG 


GAT 


TGG 


CAG 


ATG 


AAA 


AAT 


ATC 


GAC 


AAG 


1624 


Asp 


Pro 


Tyr 


Lys 


Glu 


Lys 


Leu 


Asp 


Trp 


Gin 


Met 


Lys 


Asn 


He 


Asp 


Lys 








595 










600 










605 











GCC 
Ala 


AGA 
Arg 
610 


ATC 
He 


CGC GGT 
Arg Gly 


ATC 
He 


GAG 
Glu 
615 


CTG 
Leu 


ACA GGC CGT 
Thr Gly Arg 


CTG 
Leu 
620 


AAT 
Asn 


GTG 
Val 


GAC 
Asp 


AAA 

Lys 


1872 


GTA 
Val 
625 


GCG 
Ala 


TCT 
Ser 


TTT 
Phe 


GTT 
Val 


CCT 
Pro 
630 


GAG 
Glu 


GGC 
Gly 


TGG 
Trp 


AAA 
Lys 


CTG 
Leu 
635 


TTC 
Phe 


GGC 
Gly 


TCG 
Ser 


CTG 
Leu 


GGT 
Gly 
640 


1920 


TAT 
Tyr 


GCG 
Ala 


AAA 

Lys 


AGC 
Ser 


AAA 
Lys 
645 


CTG 
Leu 


TCG 
Ser 


GGC 
Gly 


GAC 
Asp 


AAC 
Asn 
650 


AGC 
Ser 


CTG 
Leu 


CTG 
Leu 


TCC 
Ser 


ACA 
Thr 
655 


CAG 
Gin 


1968 


CCG 
Pro 


CTG 
Leu 


AAA 
Lys 


GTG 
Val 
660 


ATT 
He 


GCC 
Ala 


GGT 
Gly 


ATC 
He 


GAC 
Asp 
665 


TAT 
Tyr 


GAA 

Glu 


AGT 
Ser 


CCG 
Pro 


AGC 
Ser 
670 


GAA 
Glu 


AAA 

Lys 


2016 


TGG 
Trp 


GGC 
Gly 


GTA 
Val 
675 


TTC 
Phe 


TCC 
Ser 


CGC 
Arg 


CTG 
Leu 


ACC 
Thr 
680 


TAT 
Tyr 


CTG 
Leu 


GGC 
Gly 


GCG 
Ala 


AAA 
Lys 
685 


AAG 
Lys 


GTC 
Val 


AAA 

Lys 


2064 



GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC TGG GGT ACG CCT TTG 2112 

Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr Pro Leu 
690 695 700 

e 

CAG AAA AAG GTA AAA GAT TAC CCG TGG CTG AAC AAG TCG GCT TAT GTG 2160 

Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr Val 
705 710 715 720 

TTC GAT ATG TAC GGC TTC TAC AAA CCG GTG AAA AAC CTG ACC CTG CGT 2208 

Phe Asp Met Tyr Gly Phe Tyr Lys Pro Val Lys Asn Leu Thr Leu Arg 

725 730 735 



GCG GGC GTG TAC AAC CTG 
Ala Gly Val Tyr Asn Leu 
74 0 

CTG CGC GGT TTA TAT AGC 
Leu Arg Gly Leu Tyr Ser 
755 

GGC AAA GGC TTA GAT CGC 
Gly Lys Gly Leu Asp Arg 
770 

TCG CTG GAA TGG AAG TTT 
Ser Leu Glu Trp Lys Phe 
785 790 



TTC AAC CGC AAA TAC ACC 
Phe Asn Arg Lys Tyr Thr 
745 

TAC AGC ACC ACC AAT GCG 
Tyr Ser Thr Thr Asn Ala 
760 

TAC CGC GCC CCA GGC CGC 
Tyr Arg Ala Pro Gly Arg 
775 780 

TAA 



ACT TGG GAT TCC 2256 

Thr Trp Asp Ser 
750 

GTC GAC CGC GAT 2304 

Val Asp Arg Asp 

765 

AAT TAC GCC GTA 23 52 

Asn Tyr Ala Val 



2373 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Lys Pro Leu Gin Met Leu Pro lie Ala Ala Leu Val Gly Ser lie 
1 5 10 15 

Phe Gly Asn Pro Val Leu Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

Pro Val Lys Ala Glu He Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 

Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg He Lys Gin Glu 
50 55 60 

Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

Glu Gly Asn Arg Val Gly Val Ser He Asp Gly Val Asn Leu Pro Asp 
100 105 110 

Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Glu He Val Lys 
130 135 140 

Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

9 

Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Asp Asp Arg Gin 
165 170 175 

Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 190 

Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Asn 
210 215 220 

Arg Gly Tyr Pro Val Glu Gly Ala Gly Lys Glu Thr Asn He Arg Gly 
225 230 235 240 

Ser Ala Arg Gly He Pro Asp Pro Ser Lys His Lys Tyr His Asn Phe 
245 250 255 

Leu Gly Lys He Ala Tyr Gin He Asn Asp Asn His Arg He Gly Ala 
260 265 270 

Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 

Asn Leu Thr Ala Ser Ser Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

Arg Asn Ala Asn Leu Phe Tyr Glu Trp Met Pro Asp Ser Asn Trp Leu 
305 310 315 320 
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Ser Ser Leu Lys Ala Asp Phe Asp Tyr Gin Lys Thr Lys Val Ala Ala 
325 330 335 

lie Asn Lys Gly Ser Phe Pro Thr Asn Tyr Thr Thr Trp Glu Thr Glu 
340 345 350 

Tyr His Lys Lys Glu Val Gly Glu lie Tyr Asn Arg Ser Met Asp Thr 
355 360 365 

Arg Phe Lys Arg Phe Thr Leu Arg Leu Asp Ser His Pro Leu Gin Leu 
370 375 380 

Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Ser Arg Arg 
385 390 395 400 

Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Gly Arg Val 
405 410 415 

Val Arg Thr Thr Ser Ser lie Gin His Pro Val Lys Thr Thr Asn Tyr 
420 425 430 

Gly Phe Ser Leu Ser Asp Gin lie Gin Trp Asn Asp Val Phe Ser Ser 
435 440 445 

Arg Ala Gly lie Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu Leu 
450 455 460 

Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn Thr 
465 470 475 480 

Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Asn Gin 
485 490 495 

Ala Trp Arg Val Gly Tyr Asp lie Thr Ser Gly Tyr Arg Val Pro Asn 
500 505 510 

Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn Trp Leu 
515 520 525 

Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu Ser 
530 535 540 

Leu Gin Gly Arg Ser Glu Lys Gly Met Leu Asp Ala Asn Leu Tyr Gin 
545 550 555 560 

Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr Thr Ser 
565 570 575 

Gly Thr Pro Gly Cys Thr Glu Glu Asn Ala Tyr Tyr Ser lie Cys Ser 
580 585 590 

Asp Pro Tyr Lys Glu Lys Leu Asp Trp Gin Met Lys Asn lie Asp Lys 
595 600 605 

Ala Arg lie Arg Gly lie Glu Leu Thr Gly Arg Leu Asn Val Asp Lys 
610 615 620 

Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu Gly 
625 630 635 640 

Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr Gin 
645 650 655 

Pro Leu Lys Val lie Ala Gly lie Asp Tyr Glu Ser Pro Ser Glu Lys 
660 665 670 
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Trp Gly Val Phe 
675 



Asp Ala Gin Tyr 
690 

Gin Lys Lys Val 
705 

Phe Asp Met Tyr 



Ala Gly Val Tyr 
740 

Leu Arg Gly Leu 
755 

Gly Lys Gly Leu 
770 

Ser Leu Glu Trp 
785 



Ser Arg Leu Thr 
680 

Thr Val Tyr Glu 
695 

Lys Asp Tyr Pro 
710 

Gly Phe Tyr Lys 
725 

Asn Leu Phe Asn 



Tyr Ser Tyr Ser 
760 



Asp Arg Tyr Arg 
775 

Lys Phe 
790 



Tyr Leu Gly Ala 



Asn Lys Gly Trp 
700 

Trp Leu Asn Lys 
715 

Pro Val Lys Asn 
730 

Arg Lys Tyr Thr 
745 

Thr Thr Asn Ala 



Ala Pro Gly Arg 
780 



Lys Lys Val Lys 
685 

Gly Thr Pro Leu 



Ser Ala Tyr Val 
720 

Leu Thr Leu Arg 
735 

Thr Trp Asp Ser 
750 

Val Asp Arg Asp 

765 



Asn Tyr Ala Val 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2375 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..2375 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

ATG AAA CCA TTA CAA ATG CCC CCT ATC GCC GCG CTG CTC GGC AGT ATT 4 8 

Met Lys Pro Leu Gin Met Pro Pro lie Ala Ala Leu Leu Gly Ser lie 
1 5 10 15 

TTC GGC AAT CCG GTC TTT GCG GCA GAT GAA GCT GCA ACT GAA ACC ACA 96 
Phe Gly Asn Pro Val Phe Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

CCC GTT AAG GCA GAG GTA AAA GCA GTG CGC GTT AAA GGT CAG CGC AAT 144 
Pro Val Lys Ala Glu Val Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 

GCG CCT GCG GCT GTG GAA CGC GTC AAC CTT AAC CGT ATC AAA CAA GAA 192 
Ala Pro Ala Ala Val Glu Arg Val Ash Leu Asn Arg lie Lys Gin Glu 
50 55 60 

ATG ATA CGC GAC AAT AAA GAC TTG GTG CGC TAT TCC ACC GAT GTC GGC 24 0 

Met lie Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

TTG AGC GAC AGG AGC CGT CAT CAA AAA GGC TTT GCC ATT CGC GGC GTG 288 
Leu Ser Asp Arg Ser Arg His Gin Lys Gly Phe Ala lie Arg Gly Val 
85 90 95 
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GAA GGC GAC CGT GTC GGC GTT AGT ATT GAC GGC GTA AAC CTG CCT GAT 336 
Glu Gly Asp Arg Val Gly Val Ser lie Asp Gly Val Asn Leu Pro Asp 
100 105 110 

TCC GAA GAA AAC TCG CTG TAC GCC CGT TAT GGC AAC TTC AAC AGC TCG 364 
Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGT CTG TCT ATC GAC CCC GAA CTC GTG CGC AAC ATC GAC ATC GTA AAA 432 
Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Asp lie Val Lys 
130 135 140 

GGG GCG GAC TCT TTC AAT ACC GGC AGC GGC GCC TTG GGC GGC GGT GTG 480 
Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

AAT TAC GAA ACC CTG CAA GGA CGT GAC TTA CTG TTG CCT GAA CGG CAG 528 
Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Pro Glu Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAC AGC ACG CGT AAC CGT GAA TGG 576 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 190 

ACA AAT ACC CTC GGT TTC GGC GTG AGC AAC GAC CGC GTG GAT GCC GCT 624 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGG CGC GGC CAT GAA ACT GAA AGC GCG GGC AAG 672 
Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Lys 
210 215 220 

CGT GGT TAT CCG GTA GAG GGT GCT GGT AGC GGA GCG AAT ATC CGT GGT 720 
Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala Asn He Arg Gly 
225 230 235 240 

TCT GCG CGC GGT ATT CCT GAT CCG TCC CAA CAC AAA TAC CAC AGC TTC 768 
Ser Ala Arg Gly He Pro Asp Pro Ser Gin His Lys Tyr His Ser Phe 
245 250 255 

TTG GGT AAG ATT GCT TAT CAA ATC AAC GAC AAC CAC CGC ATC GGC GCA 816 
Leu Gly Lys He Ala Tyr Gin He Asn Asp Asn His Arg He Gly Ala 
260 265 270 

TCG CTC AAC GGT CAG CAG GGG CAT AAT TAC ACG GTT GAA GAG TCT TAC 864 
Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 

AAC CTG CTT GCT TCT TAT TGG CGT GAA GCT GAC GAT GTC AAC AGA CGG 912 
Asn Leu Leu Ala Ser Tyr Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

CGT AAC ACC AAC CTC TTT TAC GAA TGG ACG CCG GAA TCC GAC CGG TTG 960 
Arg Asn Thr Asn Leu Phe Tyr Glu Trp Thr Pro Glu Ser Asp Arg Leu 
305 310 315 320 

TCT ATG GTA AAA GCG GAT GTC GAT TAT CAA AAA ACC AAA GTA TCT GCG 1008 
Ser Met Val Lys Ala Asp Val Asp Tyr Gin Lys Thr Lys Val Ser Ala 
325 330 335 

GTC AAC TAC AAA GGT TCG TTC CCG ACG AAT TAC ACC ACA TGG GAA ACC 1056 
Val Asn Tyr Lys Gly Ser Phe Pro Thr Asn Tyr Thr Thr Trp Glu Thr 
340 345 350 

GAG TAC CAT AAA AAG GAA GTT GGC GAA ATC TAT AAC CGC AGC ATG GAT 1104 
Glu Tyr His Lys Lys Glu Val Gly Glu He Tyr Asn Arg Ser Met Asp 
355 360 365 
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ACA ACC TTC AAA CGT ATT ACG CTG CGT 
Thr Thr Phe Lys Arg lie Thr Leu Arg 
370 375 

CTC GGG GGG GGG CGA CAC CGC CTG TCG 
Leu Gly Gly Gly Arg His Arg Leu Ser 
385 390 

CGT GAT TTT GAA AAC TTA AAC CGC GAC 
Arg Asp Phe Glu Asn Leu Asn Arg Asp 
405 

GTT GTT CGA ACC ACC AAC AGT ATC CAG 
Val Val Arg Thr Thr Asn Ser lie Gin 
420 425 



ATG GAC AGC CAT CCG TTG CAA 1152 
Met Asp Ser His Pro Leu Gin 
380 

TTC AAA ACC TTT GCC GGG CAG 12 00 

Phe Lys Thr Phe Ala Gly Gin 
395 400 

GAT TAC TAC TTC AGC GGC CGT 1248 
Asp Tyr Tyr Phe Ser Gly Arg 
410 415 

CAT CCG GTG AAA ACC ACC AAC 12 96 

His Pro Val Lys Thr Thr Asn 
430 



TAC GGT TTC TCG CTG TCC GAC CAA ATC CAA TGG AAC GAC GTG TTC AGT 1344 
Tyr Gly Phe Ser Leu Ser Asp Gin lie Gin Trp Asn Asp Val Phe Ser 
435 440 445 

AGC CGC GCA GGT ATC CGT TAC GAC CAC ACC AAA ATG ACG CCT CAG GAA 13 92 

Ser Arg Ala Gly lie Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu 
450 455 460 

TTG AAT GCC GAC TGT CAT GCT TGT GAC AAA ACA CCG CCT GCA GCC AAC 144 0 

Leu Asn Ala Asp Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn 
465 470 475 480 

ACT TAT AAA GGC TGG AGC GGA TTT GTC GGC TTG GCG GCG CAG CTG AGC 1488 
Thr Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Ser 
485 490 495 

CAA ACA TGG CGT TTG GGT TAC GAT GTG ACC TCA GGT TTC CGC GTG CCG 1536 

Gin Thr Trp Arg Leu Gly Tyr Asp Val Thr Ser Gly Phe Arg Val Pro 
500 505 510 

AAT GCG TCT GAA GTG TAT TTC ACT TAC AAC CAC GGT TCG GGC ACT TGG 1584 
Asn Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Thr Trp 
515 520 525 

AAG CCT AAT CCT AAT TTG AAG GCA GAA CGC AGC ACC ACC CAC ACC CTG 16 3 2 

Lys Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu 
530 535 540 

TCC TTG CAG GGG CGC GGC GAC AAA GGG ACA CTG GAT GCC AAC CTG TAT 1680 
Ser Leu Gin Gly Arg Gly Asp Lys Gly Thr Leu Asp Ala Asn Leu Tyr 
545 550 555 560 

CAA AGC AAT TAC CGA AAC TTC CTG TCG GAA GAG CAG AAT CTG ACT GTC 1728 
Gin Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Asn Leu Thr Val 
565 570 575 

AGC GGC ACA CCC GGC TGT ACT GAG GAG GAT GCT TAC TAC TAT AGA TGC 17 76 

Ser Gly Thr Pro Gly Cys Thr Glu Glu Asp Ala Tyr Tyr Tyr Arg Cys 
580 585 590 

AGC GAC CCC TAC AAA GAA AAA CTG GAT TGG CAG ATG AAA AAT ATC GAC 1824 
Ser Asp Pro Tyr Lys Glu Lys Leu Asp Trp Gin Met Lys Asn lie Asp 
595 600 605 

AAG GCC AGA ATC CGC GGT ATC GAG TTG ACA GGC CGT CTG AAT GTG GAC 1872 
Lys Ala Arg lie Arg Gly He Glu Leu Thr Gly Arg Leu Asn Val Asp 
610 615 620 
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AAA GTA GCG TCT TTT GTT CCT GAG GGT 
Lys Val Ala Ser Phe Val Pro Glu Gly 
€25 630 

GGT TAT GCG AAA AGC AAA CTG TCG GGC 
Gly Tyr Ala Lys Ser Lys Leu Ser Gly 
645 

CAG CCG CTG AAA GTG ATT GCC GGT ATC 
Gin Pro Leu Lys Val lie Ala Gly lie 
660 665 



TGG AAA CTG TTC GGC TCG CTG 192 0 

Trp Lys Leu Phe Gly Ser Leu 
635 640 

GAC AAC AGC CTG CTG TCC ACA 1968 
Asp Asn Ser Leu Leu Ser Thr 
650 655 

GAC TAT GAA AGT CCG AGC GAA 2016 
Asp Tyr Glu Ser Pro Ser Glu 
670 



AAA TGG GGC GTA TTC TCC CGC CTG ACC TAT CTA GGC GCG AAA AAG GTC 2064 
Lys Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Val 
675 680 685 

AAA GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC TGG GGT ACG CCT 2112 
Lys Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr Pro 
690 695 700 

TTG CAG AAA AAG GTA AAA GAT TAC CCG TGG CTG AAC AAG TCG GCT TAT 2160 
Leu Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr 
705 710 715 720 

GTG TTT GAT ATG TAC GGC TTC TAC AAA CCG GCT AAA AAC CTG ACT TTG 2208 
Val Phe Asp Met Tyr Gly Phe Tyr Lys Pro Ala Lys Asn Leu Thr Leu 
725 730 735 

CGT GCA GGC GTG TAC AAC CTG TTC AAC CGC AAA TAC ACC ACT TGG GAT 2256 
Arg Ala Gly Val Tyr Asn Leu Phe Asn Arg Lys Tyr Thr Thr Trp Asp 
740 745 750 

TCC CTG CGC GGT TTA TAT AGC TAC AGC ACC ACC AAT GCG GTC GAC CGC 2304 
Ser Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arg 
755 760 765 

GAT GGC AAA GGC TTA GAC CGC TAC CGC GCC CCA GGC CGC AAT TAC GCC 2352 
Asp Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Gly Arg Asn Tyr Ala 
770 775 780 

GTA TCG CTG GAA TGG AAG TTT TAA 2375 
Val Ser Leu Glu Trp Lys Phe * 
785 790 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Lys Pro Leu Gin Met Pro Pro lie Ala Ala Leu Leu Gly Ser lie 
15 10 15 

Phe Gly Asn Pro Val Phe Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

Pro Val Lys Ala Glu Val Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 
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Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg lie Lys Gin Glu 
50 55 60 

Met lie Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

Leu Ser Asp Arg Ser Arg His Gin Lys Gly Phe Ala lie Arg Gly Val 
85 90 95 

Glu Gly Asp Arg Val Gly Val Ser lie Asp Gly Val Asn Leu Pro Asp 
100 105 no 

Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

Arg Leu Ser lie Asp Pro Glu Leu Val Arg Asn lie Asp lie Val Lys 
130 135 140 

Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Pro Glu Arg Gin 
165 170 175 

Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 190 

Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Lys 
210 215 220 

Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala Asn lie Arg Gly 
225 230 235 240 

Ser Ala Arg Gly He Pro Asp Pro Ser Gin His Lys Tyr His Ser Phe 
245 250 255 

Leu Gly Lys He Ala Tyr Gin He Asn Asp Asn His Arg He Gly Ala 
260 265 270 

Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 

Asn Leu Leu Ala Ser Tyr Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

Arg Asn Thr Asn Leu Phe Tyr Glu Trp Thr Pro Glu Ser Asp Arg Leu 
305 310 315 320 

Ser Met Val Lys Ala Asp Val Asp Tyr Gin Lys Thr Lys Val Ser Ala 
325 330 335 

Val Asn Tyr Lys Gly Ser Phe Pro Thr Asn Tyr Thr Thr Trp Glu Thr 
340 345 350 

Glu Tyr His Lys Lys Glu Val Gly Glu He Tyr Asn Arg Ser Met Asp 
355 360 365 

Thr Thr Phe Lys Arg He Thr Leu Arg Met Asp Ser His Pro Leu Gin 
370 375 380 

Leu Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Gly Gin 
385 390 395 400 



- 56 - 



BNSDOCID: <WO 9612020A3_IA> 



WO 96/12020 




PCT/US95/13623 



Arg Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Gly Arg 
405 410 415 

Val Val Arg Thr Thr Asn Ser lie Gin His Pro Val Lys Thr Thr Asn 
420 425 430 

Tyr Gly Phe Ser Leu Ser Asp Gin lie Gin Trp Asn Asp Val Phe Ser 
435 440 445 

Ser Arg Ala Gly lie Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu 
450 455 460 

Leu Asn Ala Asp Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn 
465 470 475 480 

Thr Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Ser 
485 490 495 

Gin Thr Trp Arg Leu Gly Tyr Asp Val Thr Ser Gly Phe Arg Val Pro 
500 505 510 

Asn Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Thr Trp 
515 520 525 

Lys Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu 
530 535 540 

Ser Leu Gin Gly Arg Gly Asp Lys Gly Thr Leu Asp Ala Asn Leu Tyr 
545 550 555 560 

Gin Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Asn Leu Thr Val 
565 570 575 

Ser Gly Thr Pro Gly Cys Thr Glu Glu Asp Ala Tyr Tyr Tyr Arg Cys 
580 585 590 

Ser Asp Pro Tyr Lys Glu Lys Leu Asp Trp Gin Met Lys Asn lie Asp 
595 600 605 

Lys Ala Arg lie Arg Gly lie Glu Leu Thr Gly Arg Leu Asn Val Asp 
610 615 620 

Lys Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu 
625 630 635 640 

Gly Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr 
645 650 655 

Gin Pro Leu Lys Val lie Ala Gly lie Asp Tyr Glu Ser Pro Ser Glu 
660 665 670 

Lys Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Val 
675 680 685 

Lys Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr Pro 
690 695 700 

Leu Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr 
705 710 715 720 

Val Phe Asp Met Tyr Gly Phe Tyr Lys Pro Ala Lys Asn Leu Thr Leu 
725 730 735 

Arg Ala Gly Val Tyr Asn Leu Phe Asn Arg Lys Tyr Thr Thr Trp Asp 
740 745 750 
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Ser Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arg 
755 760 765 

Asp Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Gly Arg Asn Tyr Ala 
770 775 780 

Val Ser Leu Glu Trp Lys Phe 
785 790 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2379 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

<A) NAME /KEY : CDS 

(B) LOCATION: 1..2379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG AAA CCA TTA CAA ATG CTC CCT ATC GCC GCG CTG GTC GGC AGT ATT 48 
Met Lys Pro Leu Gin Met Leu Pro lie Ala Ala Leu Val Gly Ser lie 
1 5 10 15 

TTC GGC AAT CCG GTC TTT GCG GCA GAT GAA GCT GCA ACT GAA ACC ACA 96 
Phe Gly Asn Pro Val Phe Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

CCC GTT AAG GCA GAG GTA AAA GCA GTG CGC GTT AAA GGC CAG CGC AAT 144 
Pro Val Lys Ala Glu Val Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 

GCG CCT GCG GCT GTG GAA CGC GTC AAC CTT AAC CGT ATC AAA CAA GAA 192 
Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg lie Lys Gin Glu 
50 55 60 

ATG ATA CGC GAC AAC AAA GAC TTG GTG CGC TAT TCC ACC GAT GTC GGC 24 0 

Met lie Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

TTG AGC GAC AGC GGC CGC CAT CAA AAA GGC TTT GCT GTT CGC GGC GTG 288 
Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 

85 90 95 

GAA GGC AAC CGT GTC GGC GTG AGC ATA GAC GGC GTA AAC CTG CCT GAT 336 
Glu Gly Asn Arg Val Gly Val Ser lie Asp Gly Val Asn Leu Pro Asp 
100 105 110 

TCC GAA GAA AAC TCG CTG TAC GCC CGT TAT GGC AAC TTC AAC AGC TCG 3 84 

Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGT CTG TCT ATC GAC CCC GAA CTC GTG CGC AAC ATC GAC ATC GTA AAA 432 
Arg Leu Ser lie Asp Pro Glu Leu Val Arg Asn lie Asp lie Val Lys 
130 135 140 

GGG GCG GAC TCT TTC AAT ACC GGC AGC GGC GCC TTG GGC GGC GGT GTG 480 
Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 
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AAT TAC CAA ACC CTG CAA GGA CGT GAC TTA CTG TTG CCT GAA CGG CAG 528 
Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Pro Glu Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAC AGC ACG CGT AAC CGT GAA TGG 576 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 190 

ACA AAT ACC CTC GGT TTC GGC GTG AGC AAC GAC CGC GTG GAT GCC GCT 624 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGG CGC GGC CAT GAA ACT GAA AGC GCG GGC AAG 672 
Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Lys 
210 215 220 

CGT GGT TAT CCG GTA GAG GGT GCT GGT AGC GGA GCG AAT ATC CGT GGT 720 
Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala Asn lie Arg Gly 
225 230 235 240 

TCT GCG CGC GGT ATT CCT GAT CCG TCC CAA CAC AAA TAC CAC AGC TTC 768 
Ser Ala Arg Gly lie Pro Asp Pro Ser Gin His Lys Tyr His Ser Phe 
245 250 255 

TTG GGT AAG ATT GCT TAT CAA ATC AAC GAC AAC CAC CGC ATC GGC GCA 816 
Leu Gly Lys lie Ala Tyr Gin lie Asn Asp Asn His Arg lie Gly Ala 
260 265 270 

TCG CTC AAC GGT CAG CAG GGG CAT AAT TAC ACG GTT GAA GAG TCT TAC 864 
Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 

AAC CTG CTT GCT TCT TAT TGG CGT GAA GCT GAC GAT GTC AAC AGA CGG 912 
Asn Leu Leu Ala Ser Tyr Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

CGT AAC ACC AAC CTC TTT TAC GAA TGG ACG CCG GAA TCC GAC CGG TTG 960 
Arg Asn Thr Asn Leu Phe Tyr Glu Trp Thr Pro Glu Ser Asp Arg Leu 
305 310 315 320 

TCT ATG GTA AAA GCG GAT GTC GAT TAT CAA AAA ACC AAA GTA TCT GCG 1008 
Ser Met Val Lys Ala Asp Val Asp Tyr Gin Lys Thr Lys Val Ser Ala 
325 330 335 

GTC AAC TAC AAA GGT TCG TTC CCG ATA GAG GAT TCT TCC ACC TTG ACA 1056 
Val Asn Tyr Lys Gly Ser Phe Pro lie Glu Asp Ser Ser Thr Leu Thr 
340 345 350 

CGT AAC TAC AAT CAA AAG GAC TTG GAT GAA ATC TAC AAC CGC AGT ATG 1104 
Arg Asn Tyr Asn Gin Lys Asp Leu Asp Glu lie Tyr Asn Arg Ser Met 
355 360 365 

GAT ACC CGC TTC AAA CGC ATT ACC CTG CGT TTG GAC AGC CAT CCG TTG 1152 
Asp Thr Arg Phe Lys Arg lie Thr Leu Arg Leu Asp Ser His Pro Leu 
370 375 380 

CAA CTC GGG GGG GGG CGA CAC CGC CTG TCG TTT AAA ACT TTC GCC AGC 1200 
Gin Leu Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Ser 
385 390 395 400 

CGC CGT GAT TTT GAA AAC CTA AAC CGC GAC GAT TAT TAC TTC AGC GGC 124 8 

Arg Arg Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Gly 
405 410 415 

CGT GTT GTT CGA ACC ACC AGC AGT ATC CAG CAT CCG GTG AAA ACC ACC 1296 
Arg Val Val Arg Thr Thr Ser Ser lie Gin His Pro Val Lys Thr Thr 
420 425 430 
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AAC TAC GGT TTC TCA CTG TCT GAC CAA ATT CAA TGG AAC GAC GTG TTC 1344 
Asn Tyr Gly Phe Ser Leu Ser Asp Gin lie Gin Trp Asn Asp Val Phe 
435 440 445 

AGT AGC CGC GCA GGT ATC CGT TAC GAT CAT ACC AAA ATG ACG CCT CAG 13 92 

Ser Ser Arg Ala Gly lie Arg Tyr Asp His Thr Lys Met Thr Pro Gin 
450 455 460 

GAA TTG AAT GCC GAG TGT CAT GCT TGT GAC AAA ACA CCG CCT GCA GCC 144 0 

Glu Leu Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala 
465 470 475 480 

AAC ACT TAT AAA GGC TGG AGC GGT TTT GTC GGC TTG GCG GCG CAA CTG 14 88 

Asn Thr Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu 
485 490 495 

AAT CAG GCT TGG CGT GTC GGT TAC GAC ATT ACT TCC GGC TAC CGT GTC 1536 
Asn Gin Ala Trp Arg Val Gly Tyr Asp lie Thr Ser Gly Tyr Arg Val 
500 505 510 

CCC AAT GCG TCC GAA GTG TAT TTC ACT TAC AAC CAC GGT TCG GGT AAT 1584 
Pro Asn Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn 
515 520 525 

TGG CTG CCC AAT CCC AAC CTG AAA GCC GAG CGC ACG ACC ACC CAC ACC 1632 
Trp Leu Pro Asn Pro Asn Leu Lys Ala Glu Arg Thr Thr Thr His Thr 
530 535 540 

CTC TCT CTG CAA GGC CGC AGC GAA AAA GGT ACT TTG GAT GCC AAC CTG 1680 
Leu Ser Leu Gin Gly Arg Ser Glu Lys Gly Thr Leu Asp Ala Asn Leu 
545 550 555 560 

TAT CAA AGC AAT TAC CGC AAT TTC CTG TCT GAA GAG CAG AAG CTG ACC 1728 
Tyr Gin Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr 
565 570 575 

ACC AGC GGC GAT GTC AGC TGT ACT CAG ATG AAT TAC TAC TAC GGT ATG 1776 
Thr Ser Gly Asp Val Ser Cys Thr Gin Met Asn Tyr Tyr Tyr Gly Met 
580 585 590 

TGT AGC AAT CCT TAT TCC GAA AAA CTG GAA TGG CAG ATG CAA AAT ATC 1824 
Cys Ser Asn Pro Tyr Ser Glu Lys Leu Glu Trp Gin Met Gin Asn lie 
595 600 605 

GAC AAG GCC AGA ATC CGC GGT ATC GAG CTG ACG GGC CGT CTG AAT GTG 1872 
Asp Lys Ala Arg lie Arg Gly lie Glu Leu Thr Gly Arg Leu Asn Val 
610 615 620 

GAC AAA GTA GCG TCT TTT GTT CCT GAG GGC TGG AAA CTG TTC GGC TCG 1920 
Asp Lys Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser 
625 630 635 640 

CTG GGT TAT GCG AAA AGC AAA CTG TCG GGC GAC AAC AGC CTG CTG TCC 196 8 

Leu Gly Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser 
645 650 655 

ACC CAG CCG TTG AAA GTG ATT GCC GGT ATC GAC TAT GAA AGT CCG AGC 2016 
Thr Gin Pro Leu Lys Val lie Ala Gly lie Asp Tyr Glu Ser Pro Ser 
660 665 670 

GAA AAA TGG GGC GTG TTC TCC CGC CTG ACC TAT CTG GGC GCG AAA AAG 2064 
Glu Lys Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys 
675 680 685 

GTC AAA GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC TGG GGT ACG 2112 
Val Lys Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr 
690 695 700 
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CCT TTG CAG AAA AAG GTA AAA GAT TAC CCG TGG CTG AAC AAG TCG GCT 216 0 

Pro Leu Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala 
705 710 715 720 

TAT GTG TTC GAT ATG TAC GGC TTC TAC AAA CCG GTG AAA AAC CTG ACT 2208 
Tyr Val Phe Asp Met Tyr Gly Phe Tyr Lys Pro Val Lys Asn Leu Thr 
725 730 735 

TTG CGT GCA GGC GTA TAT AAT GTG TTC AAC CGC AAA TAC ACC ACT TGG 2256 
Leu Arg Ala Gly Val Tyr Asn Val Phe Asn Arg Lys Tyr Thr Thr Trp 
740 745 750 

GAT TCC CTG CGC GGC CTG TAT AGC TAC AGC ACC ACC AAC TCG GTC GAC 2304 
Asp Ser Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ser Val Asp 
755 760 76S 

CGC GAT GGC AAA GGC TTA GAC CGC TAC CGC GCC CCA AGC CGT AAT TAC 2352 
Arg Asp Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Ser Arg Asn Tyr 
770 775 780 

GCC GTA TCG CTG GAA TGG AAG TTT TAA 2379 
Ala Val Ser Leu Glu Trp Lys Phe * 
785 790 



(2) INFORMATION FOR SEQ ID NO : 6 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 792 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Lys Pro Leu Gin Met Leu Pro lie Ala Ala Leu Val Gly Ser lie 
1 5 10 .15 

Phe Gly Asn Pro Val Phe Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

Pro Val Lys Ala Glu Val Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 

Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg lie Lys Gin Glu 
50 55 60 

Met lie Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

Glu Gly Asn Arg Val Gly Val Ser lie Asp Gly Val Asn Leu Pro Asp 
100 105 110 

Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

Arg Leu Ser lie Asp Pro Glu Leu Val Arg Asn lie Asp lie Val Lys 
130 135 140 

Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 
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Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Pro Glu Arg Gin 
165 170 175 

Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 190 

Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Lys 
210 215 220 

Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala Asn lie Arg Gly 
225 230 235 240 

Ser Ala Arg Gly lie Pro Asp Pro Ser Gin His Lys Tyr His Ser Phe 
245 250 255 

Leu Gly Lys lie Ala Tyr Gin lie Asn Asp Asn His Arg lie Gly Ala 
260 265 270 

Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 

Asn Leu Leu Ala Ser Tyr Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

Arg Asn Thr Asn Leu Phe Tyr Glu Trp Thr Pro Glu Ser Asp Arg Leu 
305 310 315 320 

Ser Met Val Lys Ala Asp Val Asp Tyr Gin Lys Thr Lys Val Ser Ala 
325 330 335 

Val Asn Tyr Lys Gly Ser Phe Pro lie Glu Asp Ser Ser Thr Leu Thr 
340 345 350 

Arg Asn Tyr Asn Gin Lys Asp Leu Asp Glu lie Tyr Asn Arg Ser Met 
355 360 365 

Asp Thr Arg Phe Lys Arg lie Thr Leu Arg Leu Asp Ser His Pro Leu 
370 375 380 

Gin Leu Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Ser 
385 390 395 400 

Arg Arg Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Gly 
405 410 415 

Arg Val Val Arg Thr Thr Ser Ser lie Gin His Pro Val Lys Thr Thr 
420 425 430 

Asn Tyr Gly Phe Ser Leu Ser Asp Gin lie Gin Trp Asn Asp Val Phe 
435 440 445 

Ser Ser Arg Ala Gly lie Arg Tyr Asp His Thr Lys Met Thr Pro Gin 
450 455 460 

Glu Leu Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala 
465 470 475 480 

Asn Thr Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu 
485 490 495 

Asn Gin Ala Trp Arg Val Gly Tyr Asp lie Thr Ser Gly Tyr Arg Val 
500 505 510 
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Pro Asn Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn 
515 520 525 

Trp Leu Pro Asn Pro Asn Leu Lys Ala Glu Arg Thr Thr Thr His Thr 
530 535 540 

Leu Ser Leu Gin Gly Arg Ser Glu Lys Gly Thr Leu Asp Ala Asn Leu 
545 550 555 560 

Tyr Gin Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr 
565 570 575 

Thr Ser Gly Asp Val Ser Cys Thr Gin Met Asn Tyr Tyr Tyr Gly Met 
580 565 590 

Cys Ser Asn Pro Tyr Ser Glu Lys Leu Glu Trp Gin Met Gin Asn lie 
595 600 605 

Asp Lys Ala Arg lie Arg Gly lie Glu Leu Thr Gly Arg Leu Asn Val 
610 615 620 

Asp Lys Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser 
625 630 635 640 

Leu Gly Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser 
645 650 655 

Thr Gin Pro Leu Lys Val lie Ala Gly lie Asp Tyr Glu Ser Pro Ser 
660 665 670 

Glu Lys Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys 
675 680 685 

Val Lys Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr 
690 695 700 

Pro Leu Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala 
705 710 715 720 

Tyr Val Phe Asp Met Tyr Gly Phe Tyr Lys Pro Val Lys Asn Leu Thr 
725 730 735 

Leu Arg Ala Gly Val Tyr Asn Val Phe Asn Arg Lys Tyr Thr Thr Trp 
740 745 750 

Asp Ser Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ser Val Asp 
755 760 765 

Arg Asp Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Ser Arg Asn Tyr 
770 775 780 

Ala Val Ser Leu Glu Trp Lys Phe 
785 790 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2378 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
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(B) LOCATION: 1..2373 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 7 : 

ATG AAA CCA TTA CAC ATG CTT CCT ATT GCC GCG CTG GTC GGC AGT ATT 4 8 

Met Lys Pro Leu His Met Leu Pro lie Ala Ala Leu Val Gly Ser lie 
15 10 is 

TTC GGC AAT CCG GTC TTG GCA GCG GAT GAA GCT GCA ACC GAA ACC ACA 96 
Phe Gly Asn Pro Val Leu Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

CCC GTT AAA GCA GAG ATA AAA GAA GTG CGC GTT AAA GAC GAG CTT AAT 144 
Pro Val Lys Ala Glu lie Lys Glu Val Arg Val Lys Asp Gin Leu Asn 
35 40 45 

GCG CCT GCA ACC GTG GAA CGT GTC AAC CTC GGC CGC ATT CAA CAG GAA 192 
Ala Pro Ala Thr Val Glu Arg Val Asn Leu Gly Arg lie Gin Gin Glu 
50 55 60 

ATG ATA CGC GAC AAC AAA GAC TTG GTG CGT TAC TCC ACC GAC GTC GGC 24 0 

Met lie Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

TTG AGC GAT AGC GGC CGC CAT CAA AAA GGC TTT GCT GTG CGC GGC GTG 288 
Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

GAA GGC AAC CGT GTC GGT GTC AGC ATT GAC GGC GTG AGC CTG CCT GAT 336 
Glu Gly Asn Arg Val Gly Val Ser lie Asp Gly Val Ser Leu Pro Asp 
100 105 110 

TCG GAA GAA AAC TCA CTG TAT GCA CGT TAT GGC AAC TTC AAC AGC TCG 3 84 

Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGC CTG TCT ATC GAC CCC GAA CTC GTG CGC AAC ATC GAA ATC GCG AAG 432 
Arg Leu Ser lie Asp Pro Glu Leu Val Arg Asn lie Glu lie Ala Lys 
130 135 140 

GGC GCT GAC TCT TTC AAT ACC GGT AGC GGC GCA TTG GGT GGC GGC GTG 480 
Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

AAT TAC CAA ACC CTG CAA GGA CAT GAT TTG CTG TTG GAC GAC AGG CAA 528 
Asn Tyr Gin Thr Leu Gin Gly His Asp Leu Leu Leu Asp Asp Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAC AGC AGC CGC AAC CGC GAA TGG 576 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Ser Arg Asn Arg Glu Trp 
180 185 190 

ACA AAT ACA CTC GGT TTC GGT GTG AGC AAC GAC CGC GTG GAT GCC GCT 624 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGT CGC GGT CAT GAG ACC GAA AGC GCG GGC GAG 672 
Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Glu 
210 215 220 

CGT GGC TAT CCG GTA GAG GGT GCT GGC AGC GGA GCA ATT ATC CGT GGT 720 
Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala lie lie Arg Gly 
225 230 235 240 

TCG TCA CGC GGT ATC CCT GAT CCG TCC AAA CAC AAA TAC CAC AAC TTC 768 
Ser Ser Arg Gly lie Pro Asp Pro Ser Lys His Lys Tyr His Asn Phe 
245 250 255 
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TTG GGT AAG ATT GCT TAT CAA ATC AAC GAC AAG CAC CGC ATC GGC CCA 816 
Leu Gly Lys lie Ala Tyr Gin lie Asn Asp Lys His Arg lie Gly Pro 
260 265 270 

TCG TTT AAC GGC CAG CAG GGG CAT AAT TAC ACG ATT GAA GAG TCT TAT 864 
Ser Phe Asn Gly Gin Gin Gly His Asn Tyr Thr lie Glu Glu Ser Tyr 
275 280 285 

AAC CTG ACC GCT TCT TCC TGG CGC GAA GCC GAT GAC GTA AAC AGA CGG 912 
Asn Leu Thr Ala Ser Ser Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

CGC AAT GCC AAC CTC TTT TAC GAA TGG ACG CCT GAT TCA AAT TGG CTG 960 
Arg Asn Ala Asn Leu Phe Tyr Glu Trp Thr Pro Asp Ser Asn Trp Leu 
305 310 315 320 

TCG TCT TTG AAG GCG GAC TTC GAT TAT CAG ACA ACC AAA GTG GCG GCG 1008 
Ser Ser Leu Lys Ala Asp Phe Asp Tyr Gin Thr Thr Lys Val Ala Ala 
325 330 335 

GTT AAC AAC AAA GGC TCG TTC CCG ACG GAT TAT TCC ACC TGG ACG CGC 1056 
Val Asn Asn Lys Gly Ser Phe Pro Thr Asp Tyr Ser Thr Trp Thr Arg 
340 345 350 

AAC TAT AAT CAG AAG GAT TTG GAG AAT ATA TAC AAC CGC AGC ATG GAC 1104 
Asn Tyr Asn Gin Lys Asp Leu Glu Asn lie Tyr Asn Arg Ser Met Asp 
355 360 36S 

ACC CGA TTC AAA CGT TTT ACT TTG CGT ATG GAC AGC CAA CCG TTG CAA 1152 
Thr Arg Phe Lys Arg Phe Thr Leu Arg Met Asp Ser Gin Pro Leu Gin 
370 375 380 

CTG GGC GGC CAA CAT CGC TTG TCG CTT AAA ACT TTC GCC AGT CGG CGT 1200 
Leu Gly Gly Gin His Arg Leu Ser Leu Lys Thr Phe Ala Ser Arg Arg 
385 390 395 400 

GAG TTT GAA AAC TTA AAC CGC GAC GAT TAT TAC TTC AGC GAA AGA GTA 124 8 

Glu Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Glu Arg Val 
405 410 415 

TCC CGT ACT ACC AGC TCG ATT CAA CAC CCC GTG AAA ACC ACT AAT TAT 1296 
Ser Arg Thr Thr Ser Ser He Gin His Pro Val Lys Thr Thr Asn Tyr 
420 425 430 

GGT TTC TCA CTG TCT GAT CAA ATC CAA TGG AAC GAC GTG TTC AGC AGC 1344 
Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe Ser Ser 
435 440 445 

CGT GCA GAT ATC CGT TAC GAT CAT ACC AAA ATG ACG CCT CAG GAA TTG 1392 
Arg Ala Asp He Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu Leu 
450 455 460 

AAT GCC GAG TGT CAT GCT TGT GAC AAA ACA CCG CCT GCA GCC AAT ACT 1440 
Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn Thr 
465 470 475 480 

TAT AAA GGC TGG AGC GGA TTT GTC GGT TTG GCG GCG CAA CTG AAT CAG 1488 
Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Asn Gin 
485 490 495 

GCT TGG CAT GTC GGT TAC GAC ATT ACT TCC GGC TAC CGT GTC CCC AAT 1536 
Ala Trp His Val Gly Tyr Asp He Thr Ser Gly Tyr Arg Val Pro Asn 
500 505 510 

GCG TCC GAA GTG TAT TTC ACT TAC AAC CAC GGT TCG GGT AAT TGG CTG 1584 
Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn Trp Leu 
515 520 525 
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CCC AAT CCC AAC CTG AAA GCC GAG CGC AGC ACC ACC CAC ACC CTG TCT 1632 
Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu Ser 
530 535 540 

CTG CAA GGC CGC AGC GAA AAA GGT ACT TTG GAT GCC AAC CTG TAT CAA 168 0 

Leu Gin Gly Arg Ser Glu Lys Gly Thr Leu Asp Ala Asn Leu Tyr Gin 
545 550 555 560 

AAC AAT TAC CGC AAC TTC TTG TCT GAA GAG CAG AAG CTG ACC ACC AGC 1728 
Asn Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr Thr Ser 
565 570 575 

GGC GAT GTC GGC TGT ACT CAG ATG AAT TAC TAC TAC GGT ATG TGT AGC 17 76 

Gly Asp Val Gly Cys Thr Gin Met Asn Tyr Tyr Tyr Gly Met Cys Ser 
580 585 590 

AAT CCT TAT TCC GAA AAA CCG GAA TGG CAG ATG CAA AAT ATC GAT AAG 1824 
Asn Pro Tyr Ser Glu Lys Pro Glu Trp Gin Met Gin Asn He Asp Lys 
595 600 605 

GCC CGA ATC CGT GGT CTT GAG CTG ACA GGC CGT CTG AAT GTG ACA AAA 18 72 

Ala Arg He Arg Gly Leu Glu Leu Thr Gly Arg Leu Asn Val Thr Lys 
610 615 620 

GTA GCG TCT TTT GTT CCT GAG GGC TGG AAA TTG TTC GGC TCG CTG GGT 1920 
Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu Gly 
625 630 635 640 

TAT GCG AAA AGC AAA CTG TCG GGC GAC AAC AGC CTG CTG TCC ACA CAG 1968 
Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr Gin 
645 650 655 

CCG CCG AAA GTG ATT GCC GGT GTC GAC TAC GAA AGC CCG AGC GAA AAA 2016 
Pro Pro Lys Val He Ala Gly Val Asp Tyr Glu Ser Pro Ser Glu Lys 
660 665 670 

TGG GGT GTG TTC TCC CGC CTG ACT TAT CTG GGT GCG AAA AAG GCC AAA 2 064 

Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Ala Lys 
675 680 685 

GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC CGG GGT ACG CCT TTG 2112 
Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Arg Gly Thr Pro Leu 
690 695 700 

CAG AAA AAG GTA AAA GAT TAC CCG TGG CTG AAC AAG TCG GCT TAT GTG 2160 
Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr Val 

705 710 715 720 

TTT GAT ATG TAC GGC TTC TAC AAA CTG GCT AAA AAC CTG ACT TTG CGT 2208 
Phe Asp Met Tyr Gly Phe Tyr Lys Leu Ala Lys Asn Leu Thr Leu Arg 
725 730 735 

GCA GGC GTA TAT AAT GTG TTC AAC CGC AAA TAC ACC ACT TGG GAT TCC 22 56 

Ala Gly Val Tyr Asn Val Phe Asn Arg Lys Tyr Thr Thr Trp Asp Ser 
740 745 750 

CTG CGC GGT TTG TAT AGC TAC AGC ACC . ACC AAC GCG GTC GAC CGA GAT 2304 
Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arg Asp 
755 760 765 

GGC AAA GGC TTA GAC CGC TAC CGC GCC TCA GGC CGT AAT TAC GCC GTA 2 3 52 

Gly Lys Gly Leu Asp Arg Tyr Arg Ala Ser Gly Arg Asn Tyr Ala Val 
770 775 780 

TCG CTG GAT TGG AAG TTT TGA ATTCC 23 78 

Ser Leu Asp Trp Lys Phe * 
785 790 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Lys Pro Leu His Met Leu Pro lie Ala Ala Leu Val Gly Ser lie 
15 10 15 

Phe Gly Asn Pro Val Leu Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

Pro Val Lys Ala Glu lie Lys Glu Val Arg Val Lys Asp Gin Leu Asn 
35 40 45 

Ala Pro Ala Thr Val Glu Arg Val Asn Leu Gly Arg lie Gin Gin Glu 
50 55 60 

Met lie Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

Glu Gly Asn Arg Val Gly Val Ser lie Asp Gly Val Ser Leu Pro Asp 
100 105 110 

Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

Arg Leu Ser lie Asp Pro Glu Leu Val Arg Asn lie Glu lie Ala Lys 
130 135 140 

Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

Asn Tyr Gin Thr Leu Gin Gly His Asp Leu Leu Leu Asp Asp Arg Gin 
165 170 175 

Phe Gly Val Met Met Lys Asn Gly Tyr Ser Ser Arg Asn Arg Glu Trp 
180 185 190 

Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Glu 
210 215 220 

Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala lie lie Arg Gly 
225 230 235 240 

Ser Ser Arg Gly lie Pro Asp Pro Ser Lys His Lys Tyr His Asn Phe 
245 250 255 

Leu Gly Lys lie Ala Tyr Gin lie Asn Asp Lys His Arg lie Gly Pro 
260 265 270 

Ser Phe Asn Gly Gin Gin Gly His Asn Tyr Thr lie Glu Glu Ser Tyr 
275 280 285 

Asn Leu Thr Ala Ser Ser Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 
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Arg Asn Ala Asn Leu Phe Tyr Glu Trp Thr Pro Asp Ser Asn Trp Leu 
305 310 315 320 

Ser Ser Leu Lys Ala Asp Phe Asp Tyr Gin Thr Thr Lys Val Ala Ala 
325 330 335 

Val Asn Asn Lys Gly Ser Phe Pro Thr Asp Tyr Ser Thr Trp Thr Arg 
340 345 350 

Asn Tyr Asn Gin Lys Asp Leu Glu Asn lie Tyr Asn Arg Ser Met Asp 
355 360 365 

Thr Arg Phe Lys Arg Phe Thr Leu Arg Met Asp Ser Gin Pro Leu Gin 
370 375 380 

Leu Gly Gly Gin His Arg Leu Ser Leu Lys Thr Phe Ala Ser Arg Arg 
385 390 395 400 

Glu Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Glu Arg Val 
405 410 415 

Ser Arg Thr Thr Ser Ser lie Gin His Pro Val Lys Thr Thr Asn Tyr 
420 425 430 

Gly Phe Ser Leu Ser Asp Gin lie Gin Trp Asn Asp Val Phe Ser Ser 
435 440 445 

Arg Ala Asp lie Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu Leu 
450 455 460 

Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn Thr 
465 470 475 480 

Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Asn Gin 
485 490 495 

Ala Trp His Val Gly Tyr Asp He Thr Ser Gly Tyr Arg Val Pro Asn 
500 505 510 

Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn Trp Leu 
515 520 525 

Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu Ser 
530 535 540 

Leu Gin Gly Arg Ser Glu Lys Gly Thr Leu Asp Ala Asn Leu Tyr Gin 
545 550 555 560 

Asn Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr Thr Ser 
565 570 575 

Gly Asp Val Gly Cys Thr Gin Met Asn Tyr Tyr Tyr Gly Met Cys Ser 
580 585 590 

Asn Pro Tyr Ser Glu Lys Pro Glu Trp Gin Met Gin Asn He Asp Lys 
595 600 605 

Ala Arg He Arg Gly Leu Glu Leu Thr Gly Arg Leu Asn Val Thr Lys 
610 615 620 

Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu Gly 
625 630 635 640 

Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr Gin 
645 650 655 
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Pro Pro Lys Val lie Ala Gly Val Asp Tyr Glu Ser Pro Ser Glu Lys 
660 665 670 

Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Ala Lys 
675 680 685 

Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Arg Gly Thr Pro Leu 
690 695 700 

Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr Val 
705 710 715 720 

Phe Asp Met Tyr Gly Phe Tyr Lys Leu Ala Lys Asn Leu Thr Leu Arg 
725 730 735 

Ala Gly Val Tyr Asn Val Phe Asn Arg Lys Tyr Thr Thr Trp Asp Ser 
740 745 750 

Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arg Asp 
755 760 765 

Gly Lys Gly Leu Asp Arg Tyr Arg Ala Ser Gly Arg Asn Tyr Ala Val 

770 775 780 

Ser Leu Asp Trp Lys Phe 
785 790 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 amino acids 

(B) TYPE: amino acid 
(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Gin Gin Gin His Leu Phe Arg Leu Asn lie Leu Cys Leu Ser Leu 
1 5 10 15 

Met Thr Ala Leu Pro Val Tyr Ala Glu Asn Val Gin Ala Glu Gin Ala 
20 25 30 

Gin Glu Lys Gin Leu Asp Thr lie Val Lys Ala Lys Lys Gin Lys Thr 
35 40 45 

Arg Arg Asp Asn Glu Val Thr Gly Leu Gly Lys Leu Val Lys Ser Ser 
50 55 60 

Asp Thr Leu Ser Lys Glu Gin Val Leu Asn lie Arg Asp Leu Thr Arg 
65 70 75 80 

Tyr Asp Pro Gly lie Ala Val Val Glu Gin Gly Arg Gly Ala Ser Ser 
85 90 95 

Gly Tyr Ser lie Arg Gly Met Asp Lys Asn Arg Val Ser Leu Thr Val 
100 105 110 

Asp Gly Val Ser Gin lie Gin Ser Tyr Thr Ala Gin Ala Ala Leu Gly 
115 120 125 

Gly Thr Arg Thr Ala Gly Ser Ser Gly Ala lie Asn Glu lie Glu Tyr 
130 135 140 
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Glu Asn Val Lys Ala Val Glu lie Ser Lys Gly Ser Asn Ser Ser Glu 
145 150 155 160 

Tyr Gly Asn Gly Ala Leu Ala Gly Ser Val Ala Phe Gin Thr Lys Thr 
165 170 175 

Ala Ala Asp lie lie Gly Glu Gly Lys Gin Trp Gly lie Gin Ser Lys 
180 185 190 

Thr Ala Tyr Ser Gly Lys Asp His Ala Leu Thr Gin Ser Leu Ala Leu 
195 200 205 

Ala Gly Arg Ser Gly Gly Ala Glu Ala Leu Leu lie Tyr Thr Lys Arg 
210 215 220 

Arg Gly Arg Glu lie His Ala His Lys Asp Ala Gly Lys Gly Val Gin 
225 230 235 240 

Ser Phe Asn Arg Leu Pro lie Cys Arg Phe Gly Asn Asn Thr Tyr Thr 
245 250 255 

Asp Cys Thr Pro Arg Asn lie Gly Gly Asn Gly Tyr Tyr Ala Ala Val 
260 265 270 

Gin Asp Asn Val Arg Leu Gly Arg Trp Ala Asp Val Gly Ala Gly lie 
275 280 285 

Arg Tyr Asp Tyr Arg Ser Thr His Ser Glu Asp Lys Ser Val Ser Thr 
290 295 300 

Gly Thr His Arg Asn Leu Ser Trp Asn Ala Gly Val Val Leu Lys Pro 
305 310 315 320 

Phe Thr Trp Met Asp Leu Thr Tyr Arg Ala Ser Thr Gly Phe Arg Leu 
325 330 335 

Pro Ser Phe Ala Glu Met Tyr Gly Trp Arg Ala Gly Glu Ser Leu Lys 
340 345 350 

Thr Leu Asp Leu Lys Pro Glu Lys Ser Phe Asn Arg Glu Ala Gly lie 
355 360 365 

Val Phe Lys Gly Asp Phe Gly Asn Leu Glu Ala Ser Tyr Phe Asn Asn 
370 375 380 

Ala Tyr Arg Asp Leu lie Ala Phe Gly Tyr Glu Thr Arg Thr Gin Asn 
385 390 395 400 

Gly Gin Thr Ser Ala Ser Gly Asp Pro Gly Tyr Arg Asn Ala Gin Asn 
405 410 . 415 

Ala Arg lie Ala Gly lie Asn lie Leu Gly Lys lie Asp Trp His Gly 
420 425 430 

Val Trp Gly Gly Leu Pro Asp Gly Leu Tyr Ser Thr Leu Ala Tyr Asn 
435 440 44 5 

Arg lie Lys Val Lys Asp Ala Asp Arg Ala Asp Arg Thr Phe Val Thr 
450 455 460 

Ser Tyr Leu Phe Asp Ala Val Gin Pro Ser Arg Tyr Val Leu Gly Leu 
465 470 475 480 

Gly Tyr Asp His Pro Asp Gly lie Trp Gly lie Asn Thr Met Phe Thr 
465 490 495 

Tyr Ser Lys Ala Lys Ser Val Asp Glu Leu Leu Gly Ser Gin Ala Leu 
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500 505 510 

Leu Asn Gly Asn Ala Asn Ala Lys Lys Ala Ala Ser Arg Arg Thr Arg 
515 520 525 

Pro Trp Tyr Val Thr Asp Val Ser Gly Tyr Tyr Asn lie Lys Lys His 
530 535 540 

Leu Thr Leu Arg Ala Gly Val Tyr Asn Leu Leu Asn Tyr Arg Tyr Val 
545 550 555 560 

Thr Trp Glu Asn Val Arg Gin Thr Ala Gly Gly Ala Val Asn Gin His 
565 570 575 

Lys Asn Val Gly Val Tyr Asn Arg Tyr Ala Ala Pro Gly Arg Asn Tyr 
580 585 590 

Thr Phe Ser Leu Glu Met Lys Phe 
595 600 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 607 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Asn Lys Lys His Gly Phe Gin Leu Thr Leu Thr Ala Leu Ala Val 
1 5 10 15 

Ala Ala Ala Phe Pro Ser Tyr Ala Ala Asn Pro Glu Thr Ala Ala Pro 
20 25 30 

Asp Ala Ala Gin Thr Gin Ser Leu Lys Glu Val Thr Val Arg Ala Ala 
35 40 45 

Lys Val Gly Arg Arg Ser Lys Glu Ala Thr Gly Leu Gly Lys lie Ala 
50 55 60 

Lys Thr Ser Glu Thr Leu Asn Lys Glu Gin Val Leu Gly lie Arg Asp 
65 70 75 80 

Leu Thr Arg Tyr Asp Pro Gly Val Ala Val Val Glu Gin Gly Asn Gly 
85 90 95 

Ala Ser Gly Gly Tyr Ser lie Arg Gly Val Asp Lys Asn Arg Val Ala 
100 105 110 

Val Ser Val Asp Gly Val Ala Gin He Gin Ala Phe Thr Val Gin Gly 
115 120 125 

Ser Leu Ser Gly Tyr Gly Gly Arg Gly Gly Ser Gly Ala He Asn Glu 
130 135 140 

He Glu Tyr Glu Asn He Ser Thr Val Glu He Asp Lys Gly Ala Gly 
145 150 155 160 

Ser Ser Asp His Gly Ser Gly Ala Leu Gly Gly Ala Val Ala Phe Arg 
165 170 175 

- 71 - 



BNSDOCID: <WO 961 2020A3 JA> 



WO 96/12020 




PCT/US95/13623 



Thr Lys Glu Ala Ala Asp Leu lie Ser Asp Gly Lys Ser Trp Gly lie 
180 185 190 

Gin Ala Lys Thr Ala Tyr Gly Ser Lys Asn Arg Gin Phe Met Lys Ser 
195 200 205 

Leu Gly Ala Gly Phe Ser Lys Asp Gly Trp Glu Gly Leu Leu lie Arg 
210 215 220 

Thr Glu Arg Gin Gly Arg Glu Thr His Pro His Gly Asp lie Ala Asp 
225 230 235 240 

Gly Val Ala Tyr Gly He Asn Arg Leu Ser Val Cys Gly Tyr He Glu 
245 250 255 

Thr Leu Arg Ser Arg Lys Cys Val Pro Arg Lys He Asn Gly Ser Asn 
260 265 270 

He His He Ser Leu Asn Asp Arg Phe Ser He Gly Lys Tyr Phe Asp 
275 280 285 

Phe Ser Leu Gly Gly Arg Tyr Asp Arg Lys Asn Phe Thr Thr Ser Glu 
290 295 300 

Glu Leu Val Arg Ser Gly Arg Tyr Val Asp Arg Ser Trp Asn Ser Gly 
305 310 315 320 

He Val Phe Lys Pro Asn Arg His Phe Ser Leu Ser Tyr Arg Ala Ser 
325 330 335 

Ser Gly Phe Arg Thr Pro Ser Phe Gin Glu Leu Phe Gly He Asp He 
340 345 350 

Tyr His Asp Tyr Pro Lys Gly Trp Gin Arg Pro Ala Leu Lys Ser Glu 
355 360 365 

Lys Ala Ala Asn Arg Glu He Gly Leu Gin Trp Lys Gly Asp Phe Gly 
370 375 380 

Phe Leu Glu He Ser Ser Phe Arg Asn Arg Tyr Thr Asp Met He Ala 
385 390 395 400 

Val Ala Asp His Lys Thr Lys Leu Pro Asn Gin Ala Gly Gin Leu Thr 
405 410 415 

Glu He Asp He Arg Asp Tyr Tyr Asn Ala Gin Asn Met Ser Leu Gin 
420 425 430 

Gly Val Asn He Leu Gly Lys He Asp Trp Asn Gly Val Tyr Gly Lys 
435 440 445 

Leu Pro Glu Gly Leu Tyr Thr Thr Leu Ala Tyr Asn Arg He Lys Pro 
450 455 460 

Lys Ser Val Ser Asn Arg Pro Gly Leu Ser Leu Arg Ser Tyr Ala Leu 
465 470 4 75 480 

Asp Ala Val Gin Pro Ser Arg Tyr Val Leu Gly Phe Gly Tyr Asp Gin 
485 490 495 

Pro Glu Gly Lys Trp Gly Ala Asn He Met Leu Thr Tyr Ser Lys Gly 
500 505 510 

Lys Asn Pro Asp Glu Leu Ala Tyr Leu Ala Gly Asp Gin Lys Arg Tyr 
515 520 525 
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Ser Thr Lys Arg Ala Ser Ser Ser 
530 535 

Tyr Leu Asn Leu Lys Lys Arg Leu 
545 550 

lie Gly Asn Tyr Arg Tyr Val Thr 
565 

Glu Ser Thr Ala Asn Arg His Gly 
580 

Ala Ala Pro Gly Arg Asn Phe Ser 
595 600 



Trp Ser Thr Ala Asp Val Ser Ala 
540 

Thr Leu Arg Ala Ala lie Tyr Asn 
555 560 

Trp Glu Ser Leu Arg Gin Thr Ala 
570 575 

Gly Asp Ser Asn Tyr Gly Arg Tyr 
585 590 

Leu Ala Leu Glu Met Lys Phe 
605 



(2) INFORMATION FOR SEQ ID NO: 11: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



AAACAGGTCT CGGCATAG 18 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CGCGAATTCA AACAGGTCTC GGCATAG 27 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CGCGAATTCA AAAACTTCCA TTCCAGCGAT ACG 33 



(2) INFORMATION FOR SEQ ID NO : 14 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
TAAAACTTCC ATTCCAGCGA TACG 24 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA ■ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
AAACAGGTCT CGGCATAG 18 



(2) INFORMATION FOR SEQ ID NO : 16 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGCGAATTCA AACAGGTCTC GG CAT AG 2 7 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CGCGAATTCA AAAACTTCCA TTCCAGCGAT ACG 3 3 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TAAAACTTCC ATTCCAGCGA TACG 24 
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WHAT WE CLAIM IS: 

1. An isolated and purified recombinant nucleic acid encoding a 
hemoglobin receptor protein from a Neisseria species. 

2. An isolated and purified recombinant nucleic acid according to Claim 
5 1 , wherein the nucleic acid encodes a hemoglobin receptor protein having an amino 

acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 2. 

3. An isolated and purified recombinant nucleic acid according to Claim 
1 , wherein the nucleic acid encodes a hemoglobin receptor protein having an amino 
acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 4. 

10 4. An isolated and purified recombinant nucleic acid according to Claim 

1 , wherein the nucleic acid encodes a hemoglobin receptor protein having an amino 
acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 6. 

5. An isolated and purified recombinant nucleic acid according to Claim 
1 , wherein the nucleic acid encodes a hemoglobin receptor protein having an amino 

15 acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 8. 

6. A homogeneous preparation of a hemoglobin receptor protein from a 
Neisseria species. 

7. The hemoglobin receptor protein of Claim 6, wherein the protein has 
an amino acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 

20 2. 

8. The hemoglobin receptor protein of Claim 6, wherein the protein has 
an amino acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 
4. 

9. The hemoglobin receptor protein of Claim 6, wherein the protein has 
25 an amino acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 

6. 

10. The hemoglobin receptor protein of Claim 6, wherein the protein has 
an amino acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 
8. 

30 11. A recombinant expression construct comprising a nucleic acid that 

encodes a hemoglobin receptor protein from a Neisseria species. 
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12. A transformed cell culture comprising the recombinant expression 
construct of Claim 1 1 . 

13. A recombinant expression construct according to Claim 11, wherein 
the nucleic acid encodes a hemoglobin receptor protein having an amino acid 

5 sequence that is the amino acid sequence depicted as Seq. I.D. No. 2. 

14. A recombinant expression construct according to Claim 11, wherein 
the nucleic acid encodes a hemoglobin receptor protein having an amino acid 
sequence that is the amino acid sequence depicted as Seq. I.D. No. 4. 

15. A recombinant expression construct according to Claim 11, wherein 
10 the nucleic acid encodes a hemoglobin receptor protein having an amino acid 

sequence that is the amino acid sequence depicted as Seq. I.D. No. 6. 

16. A recombinant expression construct according to Claim 11, wherein 
the nucleic acid encodes a hemoglobin receptor protein having an amino acid 
sequence that is the amino acid sequence depicted as Seq. I.D. No. 8. 

15 17. A transformed cell culture comprising the recombinant expression 

construct of Claims 13, 14, 15 or 16. 

18. An antibody or antigen-binding fragment thereof that is 

immunologically reactive with an antigenic epitope of a hemoglobin receptor protein 

from a Neisseria species. 
20 19. An antibody according to Claim 18 that is a monoclonal antibody. 

20. An antibody or antigen-binding fragment thereof according to Claim 
18 that is immunologically reactive with an antigenic epitope of the hemoglobin 
receptor protein depicted as Seq. I.D. No. 2. 

21. An antibody or antigen-binding fragment thereof according to Claim 
25 18 that is immunologically reactive with an antigenic epitope of the hemoglobin 

receptor protein depicted as Seq. I.D. No. 4. 

22. An antibody or antigen-binding fragment thereof according to Claim 
18 that is immunologically reactive with an antigenic epitope of the hemoglobin 
receptor protein depicted as Seq. I.D. No. 6. 

30 23. An antibody or antigen-binding fragment thereof according to Claim 

18 that is immunologically reactive with an antigenic epitope of the hemoglobin 
receptor protein depicted as Seq. I.D. No. 8. 
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15 



20 



25 



24. An antigenic epitope of a hemoglobin receptor protein from a 
Neisseria species. 

25. The antigenic epitope of Claim 24 wherein the hemoglobin receptor 
protein is the protein depicted as Seq. I.D. No. 2. 

26. The antigenic epitope of Claim 24 wherein the hemoglobin receptor 
protein is the protein depicted as Seq. I.D. No. 4. 

27. The antigenic epitope of Claim 24 wherein the hemoglobin receptor 
protein is the protein depicted as Seq. I.D. No. 6. 

28. The antigenic epitope of Claim 24 wherein the hemoglobin receptor 
protein is the protein depicted as Seq. I.D. No. 8. 

29. A diagnostic reagent for diagnosing a disease state in a human, 
wherein the disease state is caused by bacteria of a Neisseria species, the diagnostic 
reagent comprising an antibody according to Claims 18, 20, 21, 22, or 23. 

30. A diagnostic reagent for diagnosing a disease state in a human, 
wherein the disease state is caused by bacteria of a Neisseria species, the diagnostic 
reagent comprising an antibody according to Claim 19. 

31. A diagnostic reagent for diagnosing a disease state in a human, 
wherein the disease state is caused by bacteria of a Neisseria species, the diagnostic 
reagent comprising the nucleic acid of Claim 1 . 

32. A diagnostic reagent for diagnosing a disease state in a human, 
wherein the disease state is caused by bacteria of a Neisseria species, the diagnostic 
reagent comprising the nucleic acid of Claims 2, 3, 4 or 5. 

33. A therapeutic agent for treating a disease state in a human, wherein 
the disease state is caused by bacteria of a Neisseria species, the therapeutic agent 
comprising an antibody according to Claim 18, 20, 21, 22, or 23. 

34. A therapeutic agent for treating a disease state in a human, wherein 
the disease state is caused by bacteria of a Neisseria species, the therapeutic agent 
comprising an antibody according to Claim 19. 

35. A therapeutic agent for treating a disease state in a human, wherein 
the disease state is caused by bacteria of a Neisseria species, the therapeutic agent 
comprising the nucleic acid of Claim 1 or antisense homologue thereof. 
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36. A therapeutic agent for treating a disease state in a human, wherein 
the disease state is caused by bacteria of a Neisseria species, the therapeutic agent 
comprising the nucleic acid of Claims 2, 3, 4, or 5 or antisense homologue thereof. 

37. A therapeutic agent for treating a disease state in a human, wherein 
5 the disease state is caused by bacteria of a Neisseria species, the therapeutic agent 

comprising the recombinant expression construct of Claims 11, 13, 14, 15 or 16 or 
a homologue thereof that expresses the nucleic acid encoding a hemoglobin receptor 
in an antisense orientation. 

38. An antibody according to Claims 20, 21 , 22 or 23 that is a monoclonal 
10 antibody. 

39. A cell line that produces the monoclonal antibody of Claims 19 or 38. 

40. A method of treating a disease in a human caused by bacteria of a 
Neisseria species, the method comprising the step of administering a therapeutically- 
effective amount of the therapeutic agent of Claims 33, 34, 35, 36, or 37 in a 

15 phannaceutically-acceptable carrier. 

41. A method of diagnosing a disease in a human caused by bacteria of 
a Neisseria species, the method comprising the steps of contacting an amount of a 
detectably-labeled diagnostic reagent of Claims 29, 30, 31, or 32 to a biological 
sample from the human under conditions wherein the diagnostic reagent specifically 

20 binds to the Neisseria bacteria and detecting an amount of the specific binding to the 
biological sample. 

42. A vaccine that is effective in providing immunization against infection 
of a human with a bacteria of Neisseria species comprising a hemoglobin binding 
protein or antigenic fragment thereof. 

25 43. The vaccine of Claim 42 comprising the hemoglobin receptor protein 

of Claims 6, 7, 8, 9, or 10. 

44. The vaccine of Claim 42 comprising a nucleic acid encoding a 
hemoglobin receptor protein from a Neisseria species or antigenic fragment thereof. 

45. A vaccine according to Claim 44 comprising the nucleic acid of 
30 Claims 2, 3, 4, 5, 11, 13, 14, .15, or 16. 

46. The vaccine of Claim 42 comprising cells of the transformed cell 
culture of Claim 17. 
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47. A vaccine according to Claim 46 wherein the cells are attenuated 
bacterial cells. 

48. A vaccine according to Claim 47 wherein the cells are Salmonella 

cells. 

5 49. The vaccine of Claim 42 comprising the epitope of the hemoglobin 

receptor protein of Claims 24, 25, 26, 27 or 28. 
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